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SELF-PROCESSING PLANTS AND PLANT PARTS 

Related Applications 

This application is a continuation-in-part of U.S. Patent Application No. 10/228,063, filed 
August 27, 2002, which claims priority to Application Serial No. 60/315,281, filed August 27, 
2001, each of which is herein incorporated by reference in their entirety. 

Field of the Invention 

The present invention generally relates to the field of plant molecular biology, and more 
specifically, to the creation of plants that express a processing enzyme which provides a desired 
characteristic to the plant or products thereof. 

Background of the Invention 

Enzymes are used to process a variety of agricultural products such as wood, fruits and 
vegetables, starches, juices, and the like. Typically, processing enzymes are produced and 
recovered on an industrial scale from various sources, such as microbial fermentation {Bacillus 
a-amylase), or isolation from plants (coffee /?-galactosidase or papain from plant parts). Enzyme 
preparations are used in different processing applications by mixing the enzyme and the substrate 
under the appropriate conditions of moisture, temperature, time, and mechanical mixing such 
that the enzymatic reaction is achieved in a commercially viable manner. The methods involve 
separate steps of enzyme production, manufacture of an enzyme preparation, mixing the enzyme 
and substrate, and subjecting the mixture to the appropriate conditions to facilitate the enzymatic 
reaction. A method that reduces or eliminates the time, energy, mixing, capital expenses, and/or 
enzyme production costs, or results in improved or novel products, would be useful and 
beneficial. One example of where such improvements are needed is in the area of corn milling. 

Today corn is milled to obtain cornstarch and other corn-milling co-products such as corn 
gluten feed, corn gluten meal, and corn oil. The starch obtained from the process is often further 
processed into other products such as derivatized starches and sugars, or fermented to make a 
variety of products including alcohols or lactic acid. Processing of cornstarch often involves the 
use of enzymes, in particular, enzymes that hydrolyze and convert starch into fermentable sugars 
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or fructose (a- and gluco-amylase, a -glucosidase, glucose isomerase, and the like). The process 
used commercially today is capital intensive as construction of very large mills is required to 
process corn on scales required for reasonable cost-effectiveness. In addition the process 
requires the separate manufacture of starch-hydrolyzing or modifying enzymes and then the 
machinery to mix the enzyme and substrate to produce the hydrolyzed starch products. 

The process of starch recovery from corn grain is well known and involves a wet-milling 
process. Corn wet-milling includes the steps of steeping the corn kernel, grinding the corn 
kernel and separating the components of the kernel. The kernels are steeped in a steep tank with 
a countercurrent flow of water at about 1 20° F and the kernels remain in the steep tank for 24 to 
48 hours. This steepwater typically contains sulfur dioxide at a concentration of about 0.2% by 
weight. Sulfur dioxide is employed in the process to help reduce microbial growth and also to 
reduce disulfide bonds in endosperm proteins to facilitate more efficient starch-protein 
separation. Normally, about 0.59 gallons of steepwater is used per bushel of corn. The 
steepwater is considered waste and often contains undesirable levels of residual sulfur dioxide. 

The steeped kernels are then dewatered and subjected to sets of attrition type mills. The 
first set of attrition type mills rupture the kernels releasing the germ from the rest of the kernel. 
A commercial attrition type mill suitable for the wet milling business is sold under the brand 
name Bauer. Centrifiigation is used to separate the germ from the rest of the kernel. A typical 
commercial centrifiigation separator is the Merco centrifugal separator. Attrition mills and 
centrifugal separators are large expensive items that use energy to operate. 

In the next step of the process, the remaining kernel components including the starch, 
hull, fiber, and gluten are subjected to another set of attrition mills and passed through a set of 
wash screens to separate the fiber components from the starch and gluten (endosperm protein). 
The starch and gluten pass through the screens while the fiber does not. Centrifiigation or a third 
grind followed by centrifiigation is used to separate the starch from the endosperm protein. 
Centrifiigation produces a starch slurry which is dewatered, then washed with fresh water and 
dried to about 12% moisture. The substantially pure starch is typically further processed by the 
use of enzymes. 
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The separation of starch from the other components of the grain is performed because 
removing the seed coat, embryo and endosperm proteins allows one to efficiently contact the 
starch with processing enzymes, and the resulting hydrolysis products are relatively free from 
contaminants from the other kernel components. Separation also ensures that other components 
of the grain are effectively recovered and can be subsequently sold as co-products to increase the 
revenues from the mill. 

After the starch is recovered from the wet-milling process it typically undergoes the 
processing steps of gelatinization, liquefaction and dextrinization for maltodextrin production, 
and subsequent steps of saccharification, isomerization and refining for the production of 
glucose, maltose and fructose. 

Gelatinization is employed in the hydrolysis of starch because currently available 
enzymes cannot rapidly hydrolyze crystalline starch. To make the starch available to the 
hydrolytic enzymes, the starch is typically made into a slurry with water (20-40% dry solids) and 
heated at the appropriate gelling temperature. For cornstarch this temperature is between 105- 
1 10° C. The gelatinized starch is typically very viscous and is therefore thinned in the next step 
called liquefaction. Liquefaction breaks some of the bonds between the glucose molecules of the 
starch and is accomplished enzymatically or through the use of acid. Heat-stable endo a - 
amylase enzymes are used in this step, and in the subsequent step of dextrinization. The extent 
of hydrolysis is controlled in the dextrinization step to yield hydrolysis products of the desired 

percentage of dextrose. 

Further hydrolysis of the dextrin products from the liquefaction step is carried out by a 
number of different exo-amylases and debranching enzymes, depending on the products that are 
desired. And finally if fructose is desired then immobilized glucose isomerase enzyme is 
typically employed to convert glucose into fructose. 

Dry-mill processes of making fermentable sugars (and then ethanol, for example) from 
cornstarch facilitate efficient contacting of exogenous enzymes with starch. These processes are 
less capital intensive than wet-milling but significant cost advantages are still desirable, as often 
the co-products derived from these processes are not as valuable as those derived from wet- 
milling. For example, in dry milling corn, the kernel is ground into a powder to facilitate 
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efficient contact of starch by degrading enzymes. After enzyme hydrolysis of the corn flour the 
residual solids have some feed value as they contain proteins and some other components. 
Eckhoff recently described the potential for improvements and the relevant issues related to dry 
milling in a paper entitled "Fermentation and costs of fuel ethanol from corn with quick-germ 
process" (A ppl. Biochem. Biotechnol. , 94: 41 (2001)). The "quick germ" method allows for the 
separation of the oil-rich germ from the starch using a reduced steeping time. 

One example where the regulation and/or level of endogenous processing enzymes in a 
plant can result in a desirable product is sweet corn. Typical sweet corn varieties are 
distinguished from field corn varieties by the fact that sweet corn is not capable of normal levels 
of starch biosynthesis. Genetic mutations in the genes encoding enzymes involved in starch 
biosynthesis are typically employed in sweet corn varieties to limit starch biosynthesis. Such 
mutations are in the genes encoding starch synthases and ADP-glucose pyrophosphorylases 
(such as the sugary and super-sweet mutations). Fructose, glucose and sucrose,which are the 
simple sugars necessary for producing the palatable sweetness that consumers of edible fresh 
corn desire, accumulate in the developing endosperm of such mutants. However, if the level of 
starch accumulation is too high, such as when the corn is left to mature for too long (late harvest) 
or the corn is stored for an excessive period before it is consumed, the product loses sweetness 
and takes on a starchy taste and mouthfeel. The harvest window for sweet corn is therefore quite 
narrow, and shelf-life is limited. 

Another significant drawback to the farmer who plants sweet corn varieties is that the 

usefulness of these varieties is limited exclusively to edible food. If a farmer wanted to forego 

harvesting his sweet corn for use as edible food during seed development , the crop would be 

essentially a loss. The grain yield and quality of sweet corn is poor for two fundamental reasons. 

The first reason is that mutations in the starch biosynthesis pathway cripple the starch 

biosynthetic machinery and the grains do not fill out completely, causing the yield and quality to 

be compromised. Secondly, due to the high levels of sugars present in the grain and the inability 

to sequester these sugars as starch, the overall sink strength of the seed is reduced, which 

exacerbates the reduction of nutrient storage in the grain. The endosperms of sweet corn variety 

seeds are shrunken and collapsed, do not undergo proper desiccation, and are susceptible to 

diseases. The poor quality of the sweet corn grain has further agronomic implications; as poor 
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seed viability, poor germination, seedling disease susceptibility, and poor early seedling vigor 
result from the combination of factors caused by inadequate starch accumulation. Thus, the poor 
quality issues of sweet corn impact the consumer, farmer/grower, distributor, and seed producer. 

Thus, for dry-milling, there is a need for a method which improves the efficiency of the 
process and/or increases the value of the co-products. For wet-milling, there is a need for a 
method of processing starch that does not require the equipment necessary for prolonged 
steeping, grinding, milling, and/or separating the components of the kernel. For example, there 
is a need to modify or eliminate the steeping step in wet milling as this would reduce the amount 
of waste water requiring disposal, thereby saving energy and time, and increasing mill capacity 
(kernels would spend less time in steep tanks). There is also a need to eliminate or improve the 
process of separating the starch-containing endosperm from the embryo. 

Summary of the Invention 
The present invention is directed to self-processing plants and plant parts and methods of 
using the same. The self-processing plant and plant parts of the present invention are capable of 
expressing and activating enzyme(s) (mesophilic, thermophilic, and/or hyperthermophilic). 
Upon activation of the enzyme(s) (mesophilic, thermophilic, or hyperthermophilic) the plant or 
plant part is capable of self-processing the substrate upon which it acts to obtain the desired 
result. 

The present invention is directed to an isolated polynucleotide a) comprising SEQ ID 
NO: 2, 4, 6, 9, 19, 21, 25, 37, 39, 41, 43, 46, 48, 50, 52, or 59 or the complement thereof, or a 
polynucleotide which hybridizes to the complement of any one of SEQ ED NO: 2, 4, 6, 9, 19, 21, 
25, 37, 39, 41, 43, 46, 48, 50, 52, or 59 under low stringency hybridization conditions and 
encodes a polypeptide having a -amylase, pullulanase, a-glucosidase, glucose isomerase, or 
glucoamylase activity or b) encoding a polypeptide comprising SEQ ID NO: 10, 13, 14, 15, 16, 
18, 20 24, 26, 27, 28, 29, 30, 33, 34, 35, 36, 38, 40, 42, 44, 45, 47, 49, or 51 or an enzymatically 
active fragment thereof. Preferably, the isolated polynucleotide encodes a fusion polypeptide 
comprising a first polypeptide and a second peptide, wherein said first polypeptide has a - 
amylase, pullulanase, a -glucosidase, glucose isomerase, or glucoamylase activity. Most 

preferably, the second peptide comprises a signal sequence peptide, which may target the first 
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polypeptide to a vacuole, endoplasmic reticulum, chloroplast, starch granule, seed or cell wall of 
a plant. For example, the signal sequence may be an N-terminal signal sequence from waxy, an 
N-terminal signal sequence from y-zein, a starch binding domain, or a C-terminal starch binding 
domain. Polynucleotides that hybridize to the complement of any one of SEQ ED NO: 2, 9, or 52 
under low stringency hybridization conditions and encodes a polypeptide having a -amylase 
activity; to the complement of SEQ ID NO: 4 or 25 under low stringency hybridization 
conditions and encodes a polypeptide having pullulanase activity; to the complement of SEQ ID 
NO:6 and encodes a polypeptide having a -glucosidase activity; to the complement of of any one 
of SEQ ID NO: 19, 21, 37, 39, 41, or 43 under low stringency hybridization conditions and 
encodes a polypeptide having glucose isomerase activity; to the complement of any one of SEQ 
ID NO: 46, 48, 50, or 59 under low stringency hybridization conditions and encodes a 
polypeptide having glucoamylase activity are further encompassed. 

The present invention is also directed to an isolated polynucleotide a) comprising SEQ ID 
NO: 61, 63, 65, 79, 81, 83, 85, 87, 89, 91, 93, 94, 95, 96, 97, 99, 108, and 1 10 or the complement 
thereof, or a polynucleotide which hybridizes to the complement of any one of SEQ ED NO: 61, 

63, 65, 79, 81, 83, 85, 87, 89, 91, 93, 94, 95, 96, 97, 99, 108, or 1 10 under low stringency 
hybridization conditions and encodes a polypeptide having xylanase, cellulase, glucanase, beta 
glucosidase, esterase or phytase activity b) encoding a polypeptide comprising SEQ ED NO: 62, 

64, 66, 70, 80, 82, 84, 86, 88, 90, 92, 109, or 1 1 1 or an enzymatically active fragment thereof. 
The isolated polynucleotide may encode a fusion polypeptide comprising a first polypeptide and 
a second peptide, wherein said first polypeptide has xylanase, cellulase, glucanase, beta 
glucosidase, protease, or phytase activity. The second peptide may comprises a signal sequence 
peptide, which may target the first polypeptide to a vacuole, endoplasmic reticulum, chloroplast, 
starch granule, seed or cell wall of a plant. For example, the signal sequence may be an N- 
terminal signal sequence from waxy, an N-terminal signal sequence from y-zein, a starch binding 
domain, or a C-terminal starch binding domain. 

Exemplary xylanases provided and useful in the invention include those encoded by SEQ 
ED NO: 61, 63, or 65. An exemplary protease, namely bromelain, encoded by SEQ ID NO: 69 is 
also provided. Exemplary cellulases include cellobiohydrolase I and II as provided herein and 
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encoded by SEQ ID NO: 79,81,93, and 94. An exemplary glucanase is provides as 6GP1 
described herein encoded by SEQ ID NO: 85. Exemplary beta glucosidases include beta 
glucosidase 2 and D, as described herein and encoded by SEQ ID NO: 96 and 97. An exemplary 
esterase is also provided, namely ferulic acid esterase as encoded by SEQ ED NO:99. And, an 
exemplary phytase, Nov9X as encoded by SEQ ID NO: 109-1 12 is also provided. 

Also included are expression cassettes comprising a polynucleotide a) having SEQ ID 
NO: 2, 4, 6, 9, 19, 21, 25, 37, 39, 41, 43, 46, 48, 50, 52, or 59 or the complement thereof, or a 
polynucleotide which hybridizes to the complement of any one of SEQ ID NO: 2, 4, 6, 9, 19, 21, 
25, 37, 39, 41, 43, 46, 48, 50, 52, or 59 or under low stringency hybridization conditions and 
encodes an polypeptide having a -amylase, pullulanase, a -glucosidase, glucose isomerase, or 
glucoamylase activity or b) encoding a polypeptide comprising SEQ ED NO: 10, 13, 14, 15, 16, 
18, 20, 24, 26, 27, 28, 29, 30, 33, 34, 35, 36, 38, 40, 42, 44, 45, 47, 49, or 51, or an enzymatically 
active fragment thereof. The expression cassette further comprises a promoter operably linked to 
the polynucleotide, such as an inducible promoter, tissue-specific promoter, or preferably an 
endosperm-specific promoter. Preferably, the endosperm-specific promoter is a maize y-zein 
promoter or a maize ADP-gpp promoter or a maize Q promoter promoter or a rice glutelin-1 
promoter. In a preferred embodiment, the promoter comprises SEQ ED NO: 1 1 or SEQ ED NO: 
12 or SEQ ED NO: 67 or SEQ ID NO: 98. Moreover, in another preferred embodiment the 
polynucleotide is oriented in sense orientation relative to the promoter. The expression cassette 
of the present invention may further encode a signal sequence which is operably linked to the 
polypeptide encoded by the polynucleotide. The signal sequence preferably targets the operably 
linked polypeptide to a vacuole, endoplasmic reticulum, chloroplast, starch granule, seed or cell 
wall of a plant. The signal sequences include an N-terminal signal sequence from waxy, an N- 
terminal signal sequence from y-zein, or a starch binding domain. 

Moreover, an expression cassette comprising a polynucleotide a) having SEQ ID NO: 61, 
63, 65, 79, 81, 83, 85, 87, 89, 91, 93, 94, 95, 96, 97, 99, 108, and 1 10 or the complement thereof, 
or a polynucleotide which hybridizes to the complement of any one of SEQ ED NO: 61, 63, 65, 
79, 81, 83, 85, 87, 89, 91, 93, 94, 95, 96, 97, 99, 108, and 1 10 or under low stringency 
hybridization conditions and encodes an polypeptide having xylanase, cellulase, glucanase, beta 
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glucosidase, esterase or phytase activity or b) encoding a polypeptide comprising SEQ ED NO: 
62, 64, 66, 70, 80, 82, 84, 86, 88, 90, 92, 109, or 1 1 1, or an enzymatically active fragment 
thereof. The expression cassette further comprises a promoter operably linked to the 
polynucleotide, such as an inducible promoter, tissue-specific promoter, or preferably an 
endosperm-specific promoter. The endosperm-specific promoter may be a maize y-zein 
promoter or a maize ADP-gpp promoter or a maize Q promoter promoter or a rice glutelin-1 
promoter. In an embodiment, the promoter comprises SEQ ID NO: 1 1 or SEQ ID NO: 12 or 
SEQ ID NO: 67 or SEQ ID NO: 98. Moreover, in another embodiment the polynucleotide is 
oriented in sense orientation relative to the promoter. The expression cassette of the present 
invention may further encode a signal sequence which is operably linked to the polypeptide 
encoded by the polynucleotide. The signal sequence preferably targets the operably linked 
polypeptide to a vacuole, endoplasmic reticulum, chloroplast, starch granule, seed or cell wall of 
a plant. The signal sequences include an N-terminal signal sequence from waxy, an N-terminal 
signal sequence from y-zein, or a starch binding domain. 

The present invention is further directed to a vector or cell comprising the expression 
cassettes of the present invention. The cell may be selected from the group consisting of an 
Agrobacterium, a monocot cell, a dicot cell, a Liliopsida cell, a Panicoideae cell, a maize cell, 

and a cereal cell, such as a rice cell. 

Moreover, the present invention encompasses a plant stably transformed with the vectors 
of the present invention. A plant stably transformed with a vector comprising an a-amylase 
having an amino acid sequence of any of SEQ ID NO: 1, 10, 13, 14, 15, 16, 33, 35 or 88 or 
encoded by a polynucleotide comprising any of SEQ ID NO: 2, 9, or 87 is provided. 

In another embodiment, a plant stably transformed with a vector comprising a pullulanase 
having an amino acid sequence of any of SEQ ID NO: 24 or 34, or encoded by a polynucleotide 
comprising any of SEQ ID NO: 4 or 25 is provided. A plant stably transformed with a vector 
comprising an a-glucosidase having an amino acid sequence of any of SEQ ID NO: 26 or 27, or 
encoded by a polynucleotide comprising SEQ ID NO:6 is further provided. A plant stably 
transformed with a vector comprising an glucose isomerase having an amino acid sequence of 
any of SEQ ID NO: 18, 20, 28, 29, 30, 38, 40, 42, or 44, or encoded by a polynucleotide 
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comprising any of SEQ ID NO: 19, 21, 37, 39, 41, or 43 is further described herein. In another 
embodiment, a plant stably transformed with a vector comprising a glucose amylase having an 
amino acid sequence of any of SEQ ED NO: 45, 47, or 49, or encoded by a polynucleotide 
comprising any of SEQ ID NO:46, 48, 50, or 59 is described. 

An additional embodiment provides a plant stably transformed with a vector comprising a 
xylanase having an amino acid sequence of any of SEQ ID NO: 62, 64 or 66, or encoded by a 
polynucleotide comprising any of SEQ ID NO: 61, 63, or 65. A plant stably transformed with a 
vector comprising a protease is also provided. The protease may be bromelain having an amino 
acid sequence as set forth in SEQ ID NO: 70, or encoded by a polynucleotide having SEQ ID 
NO: 69. In another embodiment, a plant stably transformed with a vector comprising a cellulase 
is provided. The cellulase may be a cellobiohydrolase encoded by a polynucleotide comprising 
any of SEQ ID NO: 79, 80, 81, 82, 93 or 94. 

An additional embodiment provides a plant stably transformed with a vector comprising a 
glucanase, such as an endoglucanase. The endoglucanase may be endoglucanase I which has an 
amino acid sequence as in SEQ ID NO: 84, or encoded by a polynucleotide comprising SEQ ED 
NO: 83. A plant stably transformed with a vector comprising a beta glucosidase is also provided. 
The beta glucosidase is may be beta glucosidase 2 or beta glucosidase D, which have an amino 
acid sequence set forth in SEQ ID NO: 90 or 92, or encoded by a polynucleotide having SEQ ID 
NO: 89 or 91 . In another embodiment, a plant stably transformed with a vector comprising an 
esterase is provided. The esterase may be a ferulic acid esterase encoded by a polynucleotide 

comprising SEQ ID NO: 99. 

Plant products, such as seed, fruit or grain from the stably transformed plants of the 

present invention are further provided. 

In another embodiment, the invention is directed to a transformed plant, the genome of 
which is augmented with a recombinant polynucleotide encoding at least one processing enzyme 
operably linked to a promoter sequence, the sequence of which polynucleotide is optimized for 
expression in the plant. The plant may be a monocot, such as maize or rice, or a dicot. The 
plant may be a cereal plant or a commercially grown plant. The processing enzyme is selected 
from the group consisting of an a-amylase, glucoamylase, glucose isomerase, glucanase, 0- 
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amylase, a-glucosidase, isoamylase, pullulanase, neo-pullulanase, iso-pullulanase, 
amylopullulanase, cellulase, exo-l,4-p-cellobiohydrolase, exo-l,3-p-D-glucanase, p-glucosidase, 
endoglucanase, L-arabinase, oarabinosidase, galactanase, galactosidase, mannanase, 
mannosidase, xylanase, xylosidase, protease, glucanase, xylanase, , esterase, phytase, and lipase. 
The processing enzyme is a starch-processing enzyme selected from the group consisting of ar- 
amylase, glucoamylase, glucose isomerase, P-amylase, a-glucosidase, isoamylase, pullulanase, 
neo-pullulanase, iso-pullulanase, and amylopullulanase. The enzyme may be selected from a- 
amylase, glucoamylase, glucose isomerase, glucose isomerase, a-glucosidase, and pullulanase. 
The processing enzyme may be hyperthermophilic. In accordance with this aspect of the 
invention, the enzyme may be a non-starch degrading enzyme selected from the group consisting 
of protease, glucanase, xylanase, esterase, phytase, cellulase, beta glucosidase, and lipase. Such 
enzymes may be hyperthermophilic. In an embodiment, the enzyme accumulates in the vacuole, 
endoplasmic reticulum, chloroplast, starch granule, seed or cell wall of a plant. Moreover, in 
another embodiment, the genome of plant may be further augmented with a second recombinant 
polynucleotide comprising a non-hyperthermophilic enzyme. 

In another aspect of the invention, provided is a transformed plant, the genome of which 
is augmented with a recombinant polynucleotide encoding at least one processing enzyme 
selected from the group consisting of a-amylase, glucoamylase, glucose isomerase, a- 
glucosidase, pullulanase, xylanase, cellulase, protease, glucanase, beta glucosidase, esterase, 
phytase or lipase operably linked to a promoter sequence, the sequence of which polynucleotide 
is optimized for expression in the plant. 

Another embodiment is directed to a transformed maize plant, the genome of which is 
augmented with a recombinant polynucleotide encoding at least one processing enzyme selected 
from the group consisting of a-amylase, glucoamylase, glucose isomerase, a-glucosidase, 
pullulanase, xylanase, cellulase, protease, glucanase, phytase, beta glucosidase, esterase, or 
lipase operably linked to a promoter sequence, the sequence of which polynucleotide is 
optimized for expression in the maize plant. 

A transformed plant, the genome of which is augmented with a recombinant 
polynucleotide having SEQ ID NO: 83 operably linked to a promoter and to a signal sequence is 
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provided. Additionally, a transformed plant, the genome of which is augmented with a 
recombinant polynucleotide having the SEQ ID NO: 93 or 94 operably linked to a promoter and 
to a signal sequence is described. In another embodiment, a transformed plant, the genome of 
which is augmented with a recombinant polynucleotide having SEQ ID NO: 95, operably linked 
to a promoter and to a signal sequence. Moreover, a transformed plant, the genome of which is 
augmented with a recombinant polynucleotide having SEQ ID NO: 96 is described. Also 
described is a transformed plant, the genome of which is augmented with a recombinant 
polynucleotide having SEQ ID NO: 97. Also described is a transformed plant, the genome of 
which is augmented with a recombinant polypeptide having SEQ ID NO: 99. 

Products of the transformed plants are further envisioned herein. The product for 
example, include seed, fruit, or grain. The product may alternatively be the processing enzyme, 
starch or sugar. 

A plant obtained from a stably transformed plant of the present invention is further 
described. In this aspect, the plant may be a hybrid plant or an inbred plant. 

A starch composition is a further embodiment of the invention comprising at least one 
processing enzyme which is a protease, glucanase, or esterase. 

Grain is another embodiment of the invention comprising at least one processing enzyme, 
which is an a-amylase, pullulanase, a -glucosidase, glucoamylase, glucose isomerase, xylanase, 
cellulase, glucanase, beta glucosidase, esterase, protease, lipase or phytase. 

In another embodiment, a method of preparing starch granules, comprising 
treating grain which comprises at least one non-starch processing enzyme under conditions 
which activate the at least one enzyme, yielding a mixture comprising starch granules and non- 
starch degradation products, wherein the grain is obtained from a transformed plant, the genome 
of which is augmented with an expression cassette encoding the at least one enzyme; and 
separating starch granules from the mixture is provided. Therein, the enzyme may be a protease, 
glucanase, xylanase, phytase, lipase, beta glucosidase, cellulase or esterase. Moreover, the 
enzyme is preferably hyperthermophilic. The grain may be cracked grain and/or may be treated 
under low or high moisture conditions. Alternativley, the grain may treated with sulfur dioxide. 
The present invention may further comprise separating non-starch products from the mixture. 

The starch products and non-starch products obtained by this method are further described. 
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In yet another embodiment, a method to produce hypersweet corn comprising treating 
transformed corn or a part thereof, the genome of which is augmented with and expresses in the 
endosperm an expression cassette encoding at least one starch-degrading or starch-isomerizing 
enzyme, under conditions which activate the at least one enzyme so as to convert 
polysaccharides in the corn into sugar, yielding hypersweet corn is provided. The expression 
cassette may further comprises a promoter operably linked to the polynucleotide encoding the 
enzyme. The promoter may be a constitutive promoter, seed-specific promoter, or endosperm- 
specific promoter, for example. The enzyme may be hyperthermophilic and may be an a- 
amylase. The expression cassette used herein may further comprise a polynucleotide which 
encodes a signal sequence operably linked to the at least one enzyme. The signal sequence may 
direct the enzyme to the apoplast or the endoplasmic reticulum, for example. The enzyme 
comprises any one of SEQ ID NO: 13, 14, 15, 16, 33, or 35. The enzyme may also comprise 
SEQ ID NO: 87. 

In a most preferred embodiment, a method of producing hypersweet corn comprising 
treating transformed corn or a part thereof, the genome of which is augmented with and 
expresses in the endosperm an expression cassette encoding an a-amylase, under conditions 
which activate the at least one enzyme so as to convert polysaccharides in the corn into sugar, 
yielding hypersweet corn is described. The enzyme may be hyperthermophilic and the 
hyperthermophilic a-amylase may comprise the amino acid sequence of any of SEQ ID NO: 10, 
13, 14, 15, 16, 33, or 35, or an enzymatically active fragment thereof having a-amylase activity. 
The enzyme comprise SEQ ED NO: 87. 

A method to prepare a solution of hydrolyzed starch product comprising; 
treating a plant part comprising starch granules and at least one processing enzyme under 
conditions which activate the at least one enzyme thereby processing the starch granules to form 
an aqueous solution comprising hydrolyzed starch product, wherein the plant part is obtained 
from a transformed plant, the genome of which is augmented with an expression cassette 
encoding the at least one starch processing enzyme; and 

collecting the aqueous solution comprising the hydrolyzed starch product is described herein. 
The hydrolyzed starch product may comprise a dextrin, maltooligosaccharide, glucose and/or 
mixtures thereof. The enzyme may be a-amylase, a-glucosidase, glucoamylase, pullulanase, 
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amylopullulanase, glucose isomerase, or any combination thereof. Moreover, the enzyme may 
be hyperthermophilic. In another aspect, the genome of the plant part may be further augmented 
with an expression cassette encoding a non-hyperthermophilic starch processing enzyme. The 
non-hyperthermophilic starch processing enzyme may be selected from the group consisting of 
amylase, glucoamylase, a-glucosidase, pullulanase, glucose isomerase, or a combination thereof. 
In yet another aspect, the processing enzyme is preferably expressed in the endosperm. The 
plant part may be grain, and from corn, wheat, barley, rye, oat, sugar cane or rice. The at least 
one processing enzyme is operably linked to a promoter and to a signal sequence that targets the 
enzyme to the starch granule or the endoplasmic reticulum, or to the cell wall. The method may 
further comprise isolating the hydrolyzed starch product and/or fermenting the hydrolyzed starch 
product. 

In another aspect of the invention, a method of preparing hydrolyzed starch product 
comprising treating a plant part comprising starch granules and at least one starch processing 
enzyme under conditions which activate the at least one enzyme thereby processing the starch 
granules to form an aqueous solution comprising a hydrolyzed starch product, wherein the plant 
part is obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding at least one ar-amylase; and 

collecting the aqueous solution comprising hydrolyzed starch product is described. The a- 
amylase may be hyperthermophilic and the hyperthermophilic a-amylase comprises the amino 
acid sequence of any of SEQ ID NO: 1, 10, 13, 14, 15, 16, 33, or 35, or an active fragment 
thereof having a-amylase activity. The expression cassette may comprise a polynucleotide 
selected from any of SEQ ID NO: 2, 9, 46, or 52, a complement thereof, or a polynucleotide that 
hybridizes to any of SEQ ID NO: 2, 9, 46, or 52 under low stringency hybridization conditions 
and encodes a polypeptide having a-amylase activity. Moreover, the invention further provides 
for the genome of the transformed plant further comprising a polynucleotide encoding a non- 
thermophilic starch-processing enzyme. Alternatively, the plant part may be treated with a non- 
hyperthermophilic starch-processing enzyme. 

The present invention is further directed to a transformed plant part comprising at least 
one starch-processing enzyme present in the cells of the plant, wherein the plant part is obtained 

from a transformed plant, the genome of which is augmented with an expression cassette 
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encoding the at least one starch processing enzyme. Preferably, the enzyme is a starch- 
processing enzyme selected from the group consisting of a-amylase, glucoamylase, glucose 
isomerase, p-amylase, a-glucosidase, isoamylase, pullulanase, neo-pullulanase, iso-pullulanase, 
and amylopullulanase. Moreover, the enzyme may be hyperthermophilic. The plant may be any 
plant, such as corn or rice for example. 

Another embodiment of the invention is a transformed plant part comprising at least one 
non-starch processing enzyme present in the cell wall or the cells of the plant, wherein the plant 
part is obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding the at least one non-starch processing enzyme or at least one non-starch 
polysaccharide processing enzyme. The enzyme may be hyperthermophilic. Moreover, the non- 
starch processing enzyme may be a protease, glucanase, xylanase, esterase, phytase, beta 
glucosidase, cellulase or lipase. The plant part can be any plant part, but preferably is an ear, 
seed, fruit, grain, stover, chaff, or bagasse. 

The present invention is also directed to transformed plant parts. For example, a 
transformed plant part comprising an a-amylase having an amino acid sequence of any of SEQ 
ID NO: 1, 10, 13, 14, 15, 16, 33, or 35, or encoded by a polynucleotide comprising any of SEQ 
ED NO: 2, 9, 46, or 52, a transformed plant part comprising an a-glucosidase having an amino 
acid sequence of any of SEQ ID NO: 5, 26 or 27, or encoded by a polynucleotide comprising 
SEQ ID NO:6, a transformed plant part comprising a glucose isomerase having the amino acid 
sequence of any one of SEQ ID NO: 28, 29, 30, 38, 40, 42, or 44, or encoded by a 
polynucleotide comprising any one of SEQ ID NO: 19, 21, 37, 39, 41, or 43, a transformed plant 
part comprising a glucoamylase having the amino acid sequence of SEQ ID NO:45 or SEQ ED 
NO:47, or SEQ ID NO:49, or encoded by a 

polynucleotide comprising any of SEQ ED NO: 46, 48, 50, or 59, and a transformed plant part 
comprising a pullulanase encoded by a polynucleotide comprising any of SEQ ID NO: 4 or 25 
are described. 

The present invention is also directed to transformed plant parts. For example, a 
transformed plant part comprising a xylanase having an amino acid sequence of any of SEQ ED 
NO: 62, 64 or 66, or encoded by a polynucleotide comprising any of SEQ ID NO: 61, 63, or 65. 
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A transformed plant part comprising a protease is also provided. The protease may be bromelain 
having an amino acid sequence as set forth in SEQ ID NO: 70, or encoded by a polynucleotide 
having SEQ ID NO: 69. In another embodiment, a transformed plant part comprising a cellulase 
is provided. The cellulase may be a cellobiohydrolase encoded by a polynucleotide comprising 
any of SEQ ID NO: 79, 80, 81, 82, 93 or 94. 

An additional embodiment provides a transformed plant part a glucanase, such as an 
endoglucanase. The endoglucanase may be endoglucanase I which has an amino acid sequence 
as in SEQ ID NO: 84, or encoded by a polynucleotide comprising SEQ ED NO: 83. A 
transformed plant part comprising a beta glucosidase is also provided. The beta glucosidase is 
may be beta glucosidase 2 or beta glucosidase D, which have an amino acid sequence set forth in 
SEQ ID NO: 90 or 92, or encoded by a polynucleotide having SEQ ID NO: 89 or 91 . In another 
embodiment, a transformed plant part comprising an esterase is provided. The esterase may be a 
ferulic acid esterase encoded by a polynucleotide comprising SEQ ED NO: 99. 

Another embodiment is a method of converting starch in the transformed plant part 
comprising activating the starch processing enzyme contained therein. The starch, dextrin, 
maltooligosaccharide or sugar produced according to this method is further described. 

The present invention further describes a method of using a transformed plant part 
comprising at least one non-starch processing enzyme in the cell wall or the cell of the plant part, 
comprising treating a transformed plant part comprising at least one non-starch polysaccharide 
processing enzyme under conditions so as to activate the at least one enzyme thereby digesting 
non-starch polysaccharide to form an aqueous solution comprising oligosaccharide and/or 
sugars, wherein the plant part is obtained from a transformed plant, the genome of which is 
augmented with an expression cassette encoding the at least one non-starch polysaccharide 
processing enzyme; and collecting the aqueous solution comprising the oligosaccharides and/or 
sugars. The non-starch polysaccharide processing enzyme may be hyperthermophilic. 

A method of using transformed seeds comprising at least one processing enzyme, 

comprising treating transformed seeds which comprise at least one protease or lipase under 

conditions so as the activate the at least one enzyme yielding an aqueous mixture comprising 

amino acids and fatty acids, wherein the seed is obtained from a transformed plant, the genome 

of which is augmented with an expression cassette encoding the at least one enzyme; and 
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collecting the aqueous mixture. The amino acids, fatty acids or both are preferably isolated. The 
at least one protease or lipase may be hyperthermophilic. 

A method to prepare ethanol comprising treating a plant part comprising at least one 
polysaccharide processing enzyme under conditions to activate the at least one enzyme thereby 
digesting polysaccharide to form oligosaccharide or fermentable sugar, wherein the plant part is 
obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding the at least one polysaccharide processing enzyme; and incubating the 
fermentable sugar under conditions that promote the conversion of the fermentable sugar or 
oligosaccharide into ethanol. The 

plant part may be a grain, fruit, seed, stalks, wood, vegetable or root. The plant part may be 
obtained from a plant selected from the group consisting of oats, barley, wheat, berry, grapes, 
rye, corn, rice, potato, sugar beet, sugar cane, pineapple, grasses and trees. 
In another preferred embodiment, the polysaccharide processing enzyme is a-amylase, 
glucoamylase, a-glucosidase, glucose isomerase, pullulanase, or a combination thereof. 

A method to prepare ethanol comprising treating a plant part comprising at least one 
enzyme selected from the group consisting of a-amylase, glucoamylase, a-glucosidase, glucose 
isomerase, or pullulanase, or a combination thereof, with heat for an amount of time and under 
conditions to activate the at least one enzyme thereby digesting polysaccharide to form 
fermentable sugar, wherein the plant part is obtained from a transformed plant, the genome of 
which is augmented with an expression cassette encoding the at least one enzyme; and 
incubating the fermentable sugar under conditions that promote the conversion of the 
fermentable sugar into ethanol is provided. The at least one enzyme may be hyperthermophilic 
or mesophilic. 

In another embodiment, a method to prepare ethanol comprising treating a plant part 
comprising at least one non-starch processing enzyme under conditions to activate the at least 
one enzyme thereby digesting non-starch polysaccharide to oligosaccharide and fermentable 
sugar, wherein the plant part is obtained from a transformed plant, the genome of which is 
augmented with an expression cassette encoding the at least one enzyme; and incubating the 
fermentable sugar under conditions that promote the conversion of the fermentable sugar into 
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ethanol is provided. The non-starch processing enzyme may be a xylanase, cellulase, glucanase, 
beta glucosidase, protease, esterase, lipase or phytase. 

A method to prepare ethanol comprising treating a plant part comprising at least one 
enzyme selected from the group consisting of a-amylase, glucoamylase, a-glucosidase, glucose 
isomerase, or pullulanase, or a combination thereof, under conditions to activate the at least one 
enzyme thereby digesting polysaccharide to form fermentable sugar, wherein the plant part is 
obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding the at least one enzyme; and incubating the fermentable sugar under 
conditions that promote the conversion of the fermentable sugar into ethanol is further provided. 
The enzyme may be hyperthermophilic. 

Moreover, a method to produce a sweetened farinaceous food product without adding 
additional sweetener comprising treating a plant part comprising at least one starch processing 
enzyme under conditions which activate the at least one enzyme, thereby processing starch 
granules in the plant part to sugars so as to form a sweetened product, wherein the plant part is 
obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding the at least one enzyme; and processing the sweetened product into a 
farinaceous food product is described. The farinaceous food product may be formed from the 
sweetened product and water. Moreover, the farinaceous food product may contain malt, 
flavorings, vitamins, minerals, coloring agents or any combination thereof. The at least one 
enzyme may be hyperthermophilic. The enzyme may be selected from a-amylase, oc- 
glucosidase, glucoamylase, pullulanase, glucose isomerase, or any combination thereof. The 
plant may further be selected from the group consisting of soybean, rye, oats, barley, wheat, 
corn, rice and sugar cane. The farinaceous food product may be a cereal food, a breakfast food, a 
ready to eat food, or a baked food. The processing may include baking, boiling, heating, 
steaming, electrical discharge or any combination thereof. 

The present invention is further directed to a method to sweeten a starch-containing 
product without adding sweetener comprising treating starch comprising at least one starch 
processing enzyme under conditions to activate the at least one enzyme thereby digesting the 
starch to form a sugar to form sweetened starch, wherein the starch is obtained from a 
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transformed plant, the genome of which is augmented with an expression cassette encoding the 
at least one enzyme; and adding the sweetened starch to a product to produce a sweetened starch 
containing product. The transformed plant may be selected from the group consisting of corn, 
soybean, rye, oats, barley, wheat, rice and sugar cane. The at least one enzyme may be 
hyperthermophilic. The at least one enzyme may be a-amylase, ot-glucosidase, glucoamylase, 
pullulanase, glucose isomerase, or any combination thereof. 

A farinaceous food product and sweetened starch-containing product is provided for 

herein. 

The invention is also directed to a method to sweeten a polysaccharide-containing fruit or 
vegetable comprising treating a fruit or vegetable comprising at least one polysaccharide 
processing enzyme under conditions which activate the at least one enzyme, thereby processing 
the polysaccharide in the fruit or vegetable to form sugar, yielding a sweetened fruit or 
vegetable, wherein the fruit or vegetable is obtained from a transformed plant, the genome of 
which is augmented with an expression cassette encoding the at least one polysaccharide 
processing enzyme. The fruit or vegetable is selected from the group consisting of potato, 
tomato, banana, squash, peas, and beans. The at least one enzyme may be hyperthermophilic. 

The present invention is further directed to a method of preparing an aqueous solution 
comprising sugar comprising treating starch granules obtained from the plant part under 
conditions which activate the at least one enzyme, thereby yielding an aqueous solution 
comprising sugar. 

Another embodiment is directed to a method of preparing starch derived products from 
grain that does not involve wet or dry milling grain prior to recovery of starch-derived products 
comprising treating a plant part comprising starch granules and at least one starch processing 
enzyme under conditions which activate the at least one enzyme thereby processing the starch 
granules to form an aqueous solution comprising dextrins or sugars, wherein the plant part is 
obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding the at least one starch processing enzyme; and collecting the aqueous solution 
comprising the starch derived product. The at least one starch processing enzyme may be 
hyperthermophilic. 
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A method of isolating an a-amylase, glucoamylase, glucose isomerase, a-glucosidase, 
and pullulanase comprising culturing a transformed plant containing the a-amylase, 
glucoamylase, glucose isomerase, a-glucosidase, or pullulanase and isolating the a-amylase, 
glucoamylase, glucose isomerase, a-glucosidase or pullulanase therefrom is further provided. 
Also provided is a method of isolating a xylanase, cellulase, glucanase, beta glucosidase, 
protease, esterase, phytase or lipase comprising culturing a transformed plant containing the 
xylanase, cellulase, glucanase, beta glucosidase, protease, esterase, phytase or lipase and 
isolating the xylanase, cellulase, glucanase, esterase, beta glucosidase, protease, esterase, phytase 
or lipase . 

A method of preparing maltodextrin comprising mixing transgenic grain with water, 
heating said mixture, separating solid from the dextrin syrup generated, and 
collecting the maltodextrin. The transgenic grain comprises at least one starch processing 
enzyme. The starch processing enzyme may be a-amylase, glucoamylase, a-glucosidase, and 
glucose isomerase. Moreover, maltodextrin produced by the method is provided as well as 
composition produced by this method. 

A method of preparing dextrins, or sugars from grain that does not involve mechanical 
disruption of the grain prior to recovery of starch-derived comprising: 

treating a plant part comprising starch granules and at least one starch processing enzyme under 
conditions which activate the at least one enzyme thereby processing the starch granules to form 
an aqueous solution comprising dextrins or sugars, wherein the plant part is obtained from a 
transformed plant, the genome of which is augmented with an expression cassette encoding the 
at least one processing enzyme; and 

collecting the aqueous solution comprising sugar and/or dextrins is provided. 

The present invention is further directed to a method of producing fermentable sugar 
comprising treating a plant part comprising starch granules and at least one starch processing 
enzyme under conditions which activate the at least one enzyme thereby processing the starch 
granules to form an aqueous solution comprising dextrins or sugars, wherein the plant part is 
obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding the at least one processing enzyme; and 
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collecting the aqueous solution comprising the fermentable sugar. 

Moreover, a maize plant stably transformed with a vector comprising a hyperthermophlic 
a-amylase is provided herein. For example, preferably, a maize plant stably transformed with a 
vector comprising a polynucleotide sequence that encodes oc-amylase that is greater than 60% 
identical to SEQ ID NO: 1 or SEQ ED NO: 51 is encompassed. 

Brief Description of the Fieures 

Figures 1 A and IB illustrate the activity of ot-amylase expressed in corn kernels and in 
the endosperm from segregating Tl kernels from pNOV6201 plants and from six pNOV6200 
lines. 

Figure 2 illustrates the activity of ct-amylase in segregating Tl kernels from pNOV6201 

lines. 

Figure 3 depicts the amount of ethanol produced upon fermentation of mashes of 
transgenic corn containing thermostable 797GL3 alpha amylase that were subjected to 
liquefaction times of up to 60 minutes at 85°C and 95°C. This figure illustrates that the ethanol 
yield at 72 hours of fermentation was almost unchanged from 15 minutes to 60 minutes of 
liquefaction. Moreover, it shows that mash produced by liquefaction at 95°C produced more 
ethanol at each time point than mash produced by liquefaction at 85°C 

Figure 4 depicts the amount of residual starch (%) remaining after fermentation of 
mashes of transgenic corn containing thermostable alpha amylase that were subjected to a 
liquefaction time of up to 60 minutes at 85°C and 95°C. This figure illustrates that the ethanol 
yield at 72 hours of fermentation was almost unchanged from 15 minutes to 60 minutes of 
liquefaction. Moreover, it shows that mash produced by liquefaction at 95°C produced more 
ethanol at each time point than mash produced by liquefaction at 85°C. 

Figure 5 depicts the ethanol yields for mashes of a transgenic corn, control corn, and 
various mixtures thereof prepared at 85°C and 95°C. This figure illustrates that the transgenic 
com comprising a-amylase results in significant improvement in making starch available for 
fermentation since there was a reduction of starch left over after fermentation. 
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Figure 6 depicts the amount of residual starch measured in dried stillage following 
fermentation for mashes of a transgenic grain, control corn, and various mixtures thereof at 
prepared at 85°C and 95°C. 

Figure 7 depicts the ethanol yields as a function of fermentation time of a sample 
comprising 3% transgenic corn over a period of 20-80 hours at various pH ranges from 5.2-6.4. 
The figure illustrates that the fermentation conducted at a lower pH proceeds faster than at a pH 
of 6.0 or higher. 

Figure 8 depicts the ethanol yields during fermentation of a mash comprising various 
weight percentages of transgenic corn from 0-12 wt% at various pH ranges from 5.2-6.4. This 
figure illustrates that the ethanol yield was independent of the amount of transgenic grain 
included in the sample. 

Figure 9 shows the analysis of T2 seeds from different events transformed with pNOV 
7005. High expression of pullulanase activity, compared to the non-transgenic control, can be 

detected in a number of events. 

Figure 10A and 10B show the results of the HPLC analysis of the hydrolytic products 
generated by expressed pullulanase from starch in the transgenic corn flour. Incubation of the 
flour of pullulanase expressing corn in reaction buffer at 75 °C for 30 minutes results in 
production of medium chain oligosaccharides (degree of polymerization (DP) -10-30) and short 
amylose chains (DP ~ 100 -200) from cornstarch. Figures 10A and 10B also show the effect of 
added calcium ions on the activity of the pullulanase. 

Figures 1 1 A and 1 IB depict the data generated from HPLC analysis of the starch 
hydrolysis product from two reaction mixtures. The first reaction indicated as 'Amylase' 
contains a mixture [1:1 (w/w)] of corn flour samples of a -amylase expressing transgenic corn 
and non-transgenic corn A188; and the second reaction mixture 'Amylase + Pullulanase' 
contains a mixture [1:1 (w/w)] of corn flour samples of a -amylase expressing transgenic corn 
and pullulanase expressing transgenic corn. 

Figure 12 depicts the amount of sugar product in fig in 25 /xl of reaction mixture for two 
reaction mixtures. The first reaction indicated as 'Amylase' contains a mixture [1:1 (w/w)] of 
corn flour samples of a -amylase expressing transgenic corn and non-transgenic corn Al 88; and 
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the second reaction mixture 'Amylase + Pullulanase' contains a mixture [1:1 (w/w)] of corn flour 
samples of a -amylase expressing transgenic corn and pullulanase expressing transgenic corn. 

Figure 13A and 13B shows the starch hydrolysis product from two sets of reaction 
mixtures at the end of 30 minutes incubation at 85°C and 95°C. For each set there are two 
reaction mixtures; the first reaction indicated as 4 Amylase X Pullulanase' contains flour from 
transgenic corn (generated by cross pollination) expressing both the a-amylase and the 
pullulanase, and the second reaction indicated as 4 Amylase' mixture of corn flour samples of a - 
amylase expressing transgenic corn and non-transgenic corn A 188 in a ratio so as to obtain same 
amount of a -amylase activity as is observed in the cross (Amylase X Pullulanase). 

Figure 14 depicts the degradation of starch to glucose using non-transgenic corn seed 
(control), transgenic corn seed comprising the 797GL3 a-amylase, and a combination of 797GL3 
transgenic corn seed with Mai A a-glucosidase. 

Figure 15 depicts the conversion of raw starch at room temperature or 30°C. In this 
figure, the reaction mixtures 1 and 2 are a combination of water and starch at room temperature 
and 30°C, respectively. Reaction mixtures 3 and 4 are a combination of barley a-amylase and 
starch at room temperature and at 30°C, respectively. Reaction mixtures 5 and 6 are 
combinations of Thermoanaerobacterium glucoamylase and starch at room temperature and 
30°C, respectively. Reactions mixtures 7 and 8 are combinations of barley a-amylase (sigma) 
and Thermoanaerobacterium glucoamylase and starch at room temperature and 30°C, 
respectively. Reaction mixtures 9 and 10 are combinations of Barley alpha-amylase (sigma) 
control, and starch at room temperature and 30°C, respectively. The degree of polymerization 
(DP) of the products of the Thermoanaerobacterium glucoamylase is indicated. 

Figure 16 depicts the production of fructose from amylase transgenic corn flour using a 
combination of alpha amylase, alpha glucosidase, and glucose isomerase as described in 
Example 19. Amylase corn flour was mixed with enzyme solutions plus water or buffer. All 
reactions contained 60 mg amylase flour and a total of 600jal of liquid and were incubated for 2 
hours at 90°C. 

Figure 17 depicts the peak areas of the products of reaction with 100% amylase flour 
from a self-processing kernel as a function of incubation time from 0-1200 minutes at 90°C. 
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Figure 18 depicts the peak areas of the products of reaction with 10% transgenic amylase 
flour from a self-processing kernel and 90% control corn flour as a function of incubation time 

from 0-1200 minutes at 90°C. 

Figure 19 provides the results of the HPLC analysis of transgenic amylase flour 
incubated at 70°, 80% 90°, or 100° C for up to 90 minutes to assess the effect of temperature on 
starch hydrolysis. 

Figure 20 depicts ELSD peak area for samples containing 60 mg transgenic amylase flour 
mixed with enzyme solutions plus water or buffer under various reaction conditions. One set of 
reactions was buffered with 50 mM MOPS, pH 7.0 at room temperature, plus lOmM MgS04 and 
1 mM CoCl 2 ; in a second set of reactions the metal-containing buffer solution was replaced by 
water. All reactions were incubated for 2 hours at 90°C. 

Detailed Description of the Invention 

In accordance with the present invention, a "self-processing" plant or plant part has 
incorporated therein an isolated polynucleotide encoding a processing enzyme capable of 
processing, e.g., modifying, starches, polysaccharides, lipids, proteins, and the like in plants, 
wherein the processing enzyme can be mesophilic, thermophilic or hyperthermophilic, and may 
be activated by grinding, addition of water, heating, or otherwise providing favorable conditions 
for function of the enzyme. The isolated polynucleotide encoding the processing enzyme is 
integrated into a plant or plant part for expression therein. Upon expression and activation of the 
processing enzyme, the plant or plant part of the present invention processes the substrate upon 
which the processing enzyme acts. Therefore, the plant or plant parts of the present invention are 
capable of self-processing the substrate of the enzyme upon activation of the processing enzyme 
contained therein in the absence of or with reduced external sources normally required for 
processing these substrates. As such, the transformed plants, transformed plant cells, and 
transformed plant parts have "built-in" processing capabilities to process desired substrates via 
the enzymes incorporated therein according to this invention. Preferably, the processing 
enzyme-encoding polynucleotide are "genetically stable," i.e., the polynucleotide is stably 
maintained in the transformed plant or plant parts of the present invention and stably inherited by 
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progeny through successive generations. 

In accordance with the present invention, methods which employ such plants and plant 
parts can eliminate the need to mill or otherwise physically disrupt the integrity of plant parts 
prior to recovery of starch-derived products. For example, the invention provides improved 
methods for processing corn and other grain to recover starch-derived products. The invention 
also provides a method which allows for the recovery of starch granules that contain levels of 
starch degrading enzymes, in or on the granules, that are adequate for the hydrolysis of specific 
bonds within the starch without the requirement for adding exogenously produced starch 
hydrolyzing enzymes. The invention also provides improved products from the self-processing 
plant or plant parts obtained by the methods of the invention. 

In addition, the "self-processing" transformed plant part, e.g., grain, and transformed 
plant avoid major problems with existing technology, i.e., processing enzymes are typically 
produced by fermentation of microbes, which requires isolating the enzymes from the culture 
supernatants, which costs money; the isolated enzyme needs to be formulated for the particular 
application, and processes and machinery for adding, mixing and reacting the enzyme with its 
substrate must be developed. The transformed plant of the invention or a part thereof is also a 
source of the processing enzyme itself as well as substrates and products of that enzyme, such as 
sugars, amino acids, fatty acids and starch and non-starch polysaccharides. The plant of the 
invention may also be employed to prepare progeny plants such as hybrids and inbreds. 

Processing Enzymes And Polynucleotides Encoding Them 

A polynucleotide encoding a processing enzyme (mesophilic, thermophilic, or 
hyperthermophilic) is introduced into a plant or plant part. The processing enzyme is selected 
based on the desired substrate upon which it acts as found in plants or transgenic plants and/or 
the desired end product. For example, the processing enzyme may be a starch-processing 
enzyme, such as a starch-degrading or starch-isomerizing enzyme, or a non-starch processing 
enzyme. Suitable processing enzymes include, but are not limited to, starch degrading or 
isomerizing enzymes including, for example, oc-amylase, endo or exo-1,4, or 1,6-oc-D, 
glucoamylase, glucose isomerase, p-amylases, a-glucosidases, and other exo-amylases; and 
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starch debranching enzymes, such as isoamylase, pullulanase, neo-pullulanase, iso-pullulanase, 
amylopullulanase and the like, glycosyl transferases such as cyclodextrin glycosyltransferase and 
the like, cellulases such as exo-l,4-p-cellobiohydrolase, exo-l,3-P-D-glucanase, hemicellulase, 
P-glucosidase and the like; endoglucanases such as endo-l,3-P-glucanase and endo-l,4-P- 
glucanase and the like; L-arabinases, such as endo-l,5-a-L-arabinase, a-arabinosidases and the 
like; galactanases such as endo-l,4-p-D-galactanase, endo-l,3-p-D-galactanase, p-galactosidase, 
a-galactosidase and the like; mannanases, such as endo-l,4-P-D-mannanase, P-mannosidase, a- 
mannosidase and the like; xylanases, such as endo-l,4-p-xylanase, P-D-xylosidase, 1,3-P-D- 
xylanase, and the like; and pectinases; and non-starch processing enzymes, including protease, 
glucanase, xylanase, thioredoxin/thioredoxin reductase, esterase, phytase, and lipase. 

In one embodiment, the processing enzyme is a starch-degrading enzyme selected from 
the group of cx-amylase, pullulanase, ot-glucosidase, glucoamylase, amylopullulanase, glucose 
isomerase, or combinations thereof. According to this embodiment, the starch-degrading 
enzyme is able to allow the self-processing plant or plant part to degrade starch upon activation 
of the enzyme contained in the plant or plant part, as will be further described herein. The 
starch-degrading enzyme(s) is selected based on the desired end-products. For example, a 
glucose-isomerase may be selected to convert the glucose (hexose) into fructose. Alternatively, 
the enzyme may be selected based on the desired starch-derived end product with various chain 
lengths based on, e.g., a function of the extent of processing or with various branching patterns 
desired. For example, an a -amylase, glucoamylase, or amylopullulanase can be used under 
short incubation times to produce dextrin products and under longer incubation times to produce 
shorter chain products or sugars. A pullulanase can be used to. specifically hydrolyze branch 
points in the starch yielding a high-amylose starch, or a neopullulanase can be used to produce 
starch with stretches of a 1,4 linkages with interspersed a 1,6 linkages. Glucosidases could be 
used to produce limit dextrins, or a combination of different enzymes to make other starch 
derivatives. 

In another embodiment, the processing enzyme is a non-starch processing enzyme 
selected from protease, glucanase, xylanase, phytase, lipase, cellulase, beta glucosidase and 
esterase. These non-starch degrading enzymes allow the self-processing plant or plant part of the 
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present invention to incorporate in a targeted area of the plant and, upon activation, disrupt the 
plant while leaving the starch granule therein intact. For example, in a preferred embodiment, 
the non-starch degrading enzymes target the endosperm matrix of the plant cell and, upon 
activation, disrupt the endosperm matrix while leaving the starch granule therein intact and more 
readily recoverable from the resulting material. 

Combinations of processing enzymes are further envisioned by the present invention. 
For example, starch-processing and non-starch processing enzymes may be used in combination. 
Combinations of processing enzymes may be obtained by employing the use of multiple gene 
constructs encoding each of the enzymes. Alternatively, the individual transgenic plants stably 
transformed with the enzymes may be crossed by known methods to obtain a plant containing 
both enzymes. Another method includes the use of exogenous enzyme(s) with the transgenic 
plant. 

The processing enzymes may be isolated or derived from any source and the 
polynucleotides corresponding thereto may be ascertained by one having skill in the art. For 
example, the processing enzyme, such as a-amylase, is derived from the Pyrococcus (e.g., 
Pyrococcus furiosus), Thermus, Thermococcus (e.g., Thermococcus hydrothermalis), Sulfolobus 
(e.g., Sulfolobus solfataricus) Thermotoga (e.g., Thermotoga maritima and Thermotoga 
neapolitand), Thermoanaerobacterium (e.g. Thermoanaerobacter tengcongensis), Aspergillus 
(e.g., Aspergillus shirousami and Aspergillus niger), Rhizopus (eg., Rhizopus oryzae), 
Thermoproteales, Desulfurococcus (e.g. Desulfurococcus amylolyticus), Methanobacterium 
thermoautotrophicum, Methanococcus jannaschii, Methanopyrus kandleri y 
Thermosynechococcus elongatus, Thermoplasma acidophilum, Thermoplasma volcanium, 
Aeropyrum pernix and plants such as corn, barley, and rice. 

The processing enzymes of the present invention are capable of being activated after 
being introduced and expressed in the genome of a plant. Conditions for activating the enzyme 
are determined for each individual enzyme and may include varying conditions such as 
temperature, pH, hydration, presence of metals, activating compounds, inactivating compounds, 
etc. For example, temperature-dependent enzymes may include mesophilic, thermophilic, and 
hyperthermophilic enzymes. Mesophilic enzymes typically have maximal activity at 
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temperatures between 20°- 65 °C and are inactivated at temperatures greater than 70° C. 
Mesophilic enzymes have significant activity at 30 to 37°C, the activity at 30 °C is preferably at 
least 10% of maximal activity, more preferably at least 20% of maximal activity. 

Thermophilic enzymes have a maximal activity at temperatures of between 50 and 80° C 
and are inactivated at temperatures greater than 80°C . A thermophilic enzyme will preferably 
have less than 20% of maximal activity at 30°C, more preferably less than 10% of maximal 
activity. 

A "hyperthermophilic" enzyme has activity at even higher temperatures. 
Hyperthermophilic enzymes have a maximal activity at temperatures greater than 80° C and 
retain activity at temperatures at least 80°C, more preferably retain activity at temperatures of at 
least 90°C and most preferably retain activity at temperatures of at least 95°C. 
Hyperthermophilic enzymes also have reduced activity at low temperatures. A 
hyperthermophilic enzyme may have activity at 30°C that is less than 10% of maximal activity, 
and preferably less than 5% of maximal activity. 

The polynucleotide encoding the processing enzyme is preferably modified to include 
codons that are optimized for expression in a selected organism such as a plant (see, e.g., Wada 
et al., Nucl. Acids Res. , 18:2367 (1990), Murray et at., Nucl. Acids Res. , 17:477 (1989), U.S. 
Patent Nos. 5,096,825, 5,625,136, 5,670,356 and 5,874,304). Codon optimized sequences are 
synthetic sequences, i.e., they do not occur in nature, and preferably encode the identical 
polypeptide (or an enzymatically active fragment of a full length polypeptide which has 
substantially the same activity as the full length polypeptide) encoded by the non-codon 
optimized parent polynucleotide which encodes a processing enzyme. It is preferred that the 
polypeptide is biochemically distinct or improved, e.g., via recursive mutagenesis of DNA 
encoding a particular processing enzyme, from the parent source polypeptide such that its 
performance in the process application is improved. Preferred polynucleotides are optimized for 
expression in a target host plant and encode a processing enzyme. Methods to prepare these 
enzymes include mutagenesis, e.g., recursive mutagenesis and selection. Methods for mutagenesis 
and nucleotide sequence alterations are well-known in the art. See, for example, Kunkel, Proc. 
Natl. Acad. Sci. USA , 82:488, (1985); Kunkel et al., Methods in EnzvmoL 154:367 (1987); US 
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Patent No. 4,873,192; Walker and Gaastra, eds. (1983) Techniques in Molecular Biology 
(MacMillan Publishing Company, New York) and the references cited therein and Arnold et al., 
Chem. Eng. Sci. , 51:5091 (1996)). Methods to optimize the expression of a nucleic acid segment 
in a target plant or organism are well-known in the art. Briefly, a codon usage table indicating the 
optimal codons used by the target organism is obtained and optimal codons are selected to replace 
those in the target polynucleotide and the optimized sequence is then chemically synthesized. 
Preferred codons for maize are described in U.S. Patent No. 5,625,136. 

Complementary nucleic acids of the polynucleotides of the present invention are further 
envisioned. An example of low stringency conditions for hybridization of complementary 
nucleic acids which have more than 100 complementary residues on a filter in a Southern or 
Northern blot is 50% formamide, e.g., hybridization in 50% formamide, 1 M NaCl, 1% SDS at 
37°C, and a wash in 0.1 X SSC at 60°C to 65°C. Exemplary low stringency conditions include 
hybridization with a buffer solution of 30 to 35% formamide, 1 M NaCl, 1% SDS (sodium 
dodecyl sulphate) at 37°C, and a wash in IX to 2X SSC (20X SSC = 3.0 M NaCl/0.3 M 
trisodium citrate) at 50 to 55°C. Exemplary moderate stringency conditions include 
hybridization in 40 to 45% formamide, 1.0 M NaCl, 1% SDS at 37°C, and a wash in 0.5X to IX 
SSC at 55 to 60°C. 

Moreover, polynucleotides encoding an "enzymatically active" fragment of the 
processing enzymes are further envisioned. As used herein, "enzymatically active" means a 
polypeptide fragment of the processing enzyme that has substantially the same biological activity 
as the processing enzyme to modify the substrate upon which the processing enzyme normally 
acts under appropriate conditions. 

In a preferred embodiment, the polynucleotide of the present invention is a maize- 
optimized polynucleotide encoding a-amylase, such as provided in SEQ ID NOs:2, 9, 46, and 52. 
In another preferred embodiment, the polynucleotide is a maize-optimized polynucleotide 
encoding pullulanase, such as provided in SEQ ID NOs: 4 and 25. In yet another preferred 
embodiment, the polynucleotide is a maize-optimized polynucleotide encoding a-glucosidase as 
provided in SEQ ED NO:6. Another preferred polynucleotide is the maize-optimized 
polynucleotide encoding glucose ispmerase having SEQ ED NO: 19, 21, 37, 39, 41, or 43. In 
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another embodiment, the maize-optimized polynucleotide encoding glucoamylase as set forth in 
SEQ ID NO: 46, 48, or 50 is preferred. Moreover, a maize-optimized polynucleotide for 
glucanase/mannanase fusion polypeptide is provided in SEQ ID NO: 57. The invention further 
provides for complements of such polynucleotides, which hybridize under moderate, or 
preferably under low stringency, hybridization conditions and which encodes a polypeptide 
having oc-amylase, pullulanase, a-glucosidase, glucose isomerase, glucoamylase, glucanase, or 
mannanase activity, as the case may be. 

The polynucleotide may be used interchangeably with "nucleic acid" or "polynucleic 
acid" and refers to deoxyribonucleotides or ribonucleotides and polymers thereof in either single- 
or double-stranded form, composed of monomers (nucleotides) containing a sugar, phosphate 
and a base, which is either a purine or pyrimidine. Unless specifically limited, the term 
encompasses nucleic acids containing known analogs of natural nucleotides, which have similar 
binding properties as the reference nucleic acid and are metabolized in a manner similar to 
naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence 
also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon 
substitutions) and complementary sequences as well as the sequence explicitly indicated. 
Specifically, degenerate codon substitutions may be achieved by generating sequences in which 
the third position of one or more selected (or all) codons is substituted with mixed-base and/or 
deoxyinosine residues. 

"Variants" or substantially similar sequences are further encompassed herein. For 
nucleotide sequences, variants include those sequences that, because of the degeneracy of the 
genetic code, encode the identical amino acid sequence of the native protein. Naturally 
occurring allelic variants such as these can be identified with the use of well-known molecular 
biology techniques, as, for example, with polymerase chain reaction (PCR), hybridization 
techniques, and ligation reassembly techniques. Variant nucleotide sequences also include 
synthetically derived nucleotide sequences, such as those generated, for example, by using site- 
directed mutagenesis, which encode the native protein, as well as those that encode a polypeptide 
having amino acid substitutions. Generally, nucleotide sequence variants of the invention will 
have at least 40%, 50%, 60%, preferably 70%, more preferably 80%, even more preferably 90%, 
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most preferably 99%, and single unit percentage identity to the native nucleotide sequence based 
on these classes. For example, 71%, 72%, 73% and the like, up to at least the 90% class. Variants 
may also include a full-length gene corresponding to an identified gene fragment. 

Regulatory Sequences: Promoters/Signal Sequences/Selectable Markers 

The polynucleotide sequences encoding the processing enzyme of the present invention 
may be operably linked to polynucleotide sequences encoding localization signals or signal 
sequence (at the N- or C-terminus of a polypeptide), e.g., to target the hyperthermophilic enzyme 
to a particular compartment within a plant. Examples of such targets include, but are not limited 
to, the vacuole, endoplasmic reticulum, chloroplast, amyloplast, starch granule, or cell wall, or to 
a particular tissue, e.g., seed. The expression of a polynucleotide encoding a processing enzyme 
having a signal sequence in a plant, in particular, in conjunction with the use of a tissue-specific 
or inducible promoter, can yield high levels of localized processing enzyme in the plant. 
Numerous signal sequences are known to influence the expression or targeting of a 
polynucleotide to a particular compartment or outside a particular compartment. Suitable signal 
sequences and targeting promoters are known in the art and include, but are not limited to, those 
provided herein. 

For example, where expression in specific tissues or organs is desired, tissue-specific 
promoters may be used. In contrast, where gene expression in response to a stimulus is desired, 
inducible promoters are the regulatory elements of choice. Where continuous expression is 
desired throughout the cells of a plant, constitutive promoters are utilized. Additional regulatory 
sequences upstream and/or downstream from the core promoter sequence may be included in 
expression constructs of transformation vectors to bring about varying levels of expression of 
heterologous nucleotide sequences in a transgenic plant. 

A number of plant promoters have been described with various expression characteristics. 
Examples of some constitutive promoters which have been described include the rice actin 1 
(Wang et al., Mol. Cell. Biol. , 12:3399 (1992); U.S. Patent No. 5,641,876), CaMV 35S (Odell et 
al., Nature , 313:810 (1985)), CaMV 19S (Lawton et al., 1987), nos (Ebert et al., 1987), Adh 
(Walker et al., 1987), sucrose synthase (Yang & Russell, 1990), and the ubiquitin promoters. 
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Vectors for use in tissue-specific targeting of genes in transgenic plants will typically 
include tissue-specific promoters and may also include other tissue-specific control elements 
such as enhancer sequences. Promoters which direct specific or enhanced expression in certain 
plant tissues will be known to those of skill in the art in light of the present disclosure. These 
include, for example, the rbcS promoter, specific for green tissue; the ocs, nos and mas 
promoters which have higher activity in roots or wounded leaf tissue; a truncated (-90 to +8) 35S 
promoter which directs enhanced expression in roots, an a-tubulin gene that directs expression in 
roots and promoters derived from zein storage protein genes which direct expression in 
endosperm. 

Tissue specific expression may be functionally accomplished by introducing a 
constitutively expressed gene (all tissues) in combination with an antisense gene that is 
expressed only in those tissues where the gene product is not desired. For example, a gene 
coding for a lipase may be introduced such that it is expressed in all tissues using the 35S 
promoter from Cauliflower Mosaic Virus. Expression of an antisense transcript of the lipase 
gene in a maize kernel, using for example a zein promoter, would prevent accumulation of the 
lipase protein in seed. Hence the protein encoded by the introduced gene would be present in all 

tissues except the kernel. 

Moreover, several tissue-specific regulated genes and/or promoters have been reported in 
plants. Some reported tissue-specific genes include the genes encoding the seed storage proteins 
(such as napin, cruciferin, beta-conglycinin, and phaseolin) zein or oil body proteins (such as 
oleosin), or genes involved in fatty acid biosynthesis (including acyl carrier protein, stearoyl- 
ACP desaturase, and fatty acid desaturases (fad 2-1)), and other genes expressed during embryo 
development (such as Bce4, see, for example, EP 255378 and Kridl et al., Seed Science 
Research , 1:209 (1991)). Examples of tissue-specific promoters, which have been described 
include the lectin (Vodkin, Prog. Clin. Biol. Res. , 138:87 (1983); Lindstrom et al., Per. Genet., 
JT:160 (1990)), corn alcohol dehydrogenase 1 (Vogel et al., 1989; Dennis et al., Nucleic Acids 
Res. , 12:3983 (1984)), corn light harvesting complex (Simpson, 1986; Bansal et al., Proc. Natl. 
Acad. Sci. USA , 89:3654 (1992)), corn heat shock protein (Odell et al., 1985; Rochester et al., 
1986), pea small subunit RuBP carboxylase (Poulsen et al., 1986; Cashmore et al., 1983), Ti 
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plasmid mannopine synthase (Langridge et al., 1989), Ti plasmid nopaline synthase (Langridge 
et al., 1989), petunia chalcone isomerase (vanTunen et al., EMBO J. , 7; 1257(1 988)), bean 
glycine rich protein 1 (Keller et al., Genes Dev. , 3: 1639 (1989)), truncated CaMV 35s (Odell et 
al., Nature , 313:810 (1985)), potato patatin (Wenzler et al., Plant Mol. Biol. , 13:347 (1989)), root 
cell (Yamamoto et al., Nucleic Acids Res. , 18:7449 (1990)), maize zein (Reina et al.. Nucleic 
Acids Res. , 18:6425 (1990); Kriz et al., Mol. Gen. Genet. , 207:90 (1987); Wandelt et al., Nucleic 
Acids Res. , 17:2354 (1989); Langridge et al., Cell , 34:1015 (1983); Reina et al., Nucleic Acids 
Res. , 18:7449 (1990)), globulin- 1 (Belanger et al., Genetics , 129:863 (1991)), ot-tubulin, cab 
(Sullivan et al., Mol. Gen. Genet. , 215:431 (1989)), PEPCase (Hudspeth & Grula, 1989), R gene 
complex-associated promoters (Chandler et al., Plant Cell , 1:1 175 (1989)), and chalcone 
synthase promoters (Franken et al., EMBO J. , 10:2605 (1991)). Particularly useful for seed- 
specific expression is the pea vicilin promoter (Czako et al., Mol. Gen. Genet. , 235:33 (1992). 
(See also U.S. Pat. No. 5,625,136, herein incorporated by reference.) Other useful promoters for 
expression in mature leaves are those that are switched on at the onset of senescence, such as the 
SAG promoter from Arabidopsis (Gan et al., Science , 270 :1986 (1995). 

A class of fruit-specific promoters expressed at or during anthesis through fruit 
development, at least until the beginning of ripening, is discussed in U.S. 4,943,674, the 
disclosure of which is hereby incorporated by reference. cDNA clones that are preferentially 
expressed in cotton fiber have been isolated (John et al., Proc. Natl. Acad. Sci. USA , 89:5769 
(1992). cDNA clones from tomato displaying differential expression during fruit development 
have been isolated and characterized (Mansson et al., Gen. Genet., 200 :356 (1985), Slater et al., 
Plant Mol. Biol. , 5:137 (1985)). The promoter for polygalacturonase gene is active in fruit 
ripening. The polygalacturonase gene is described in U.S. Patent No. 4,535,060, U.S. Patent No. 
4,769,061, U.S. Patent No. 4,801,590, and U.S. Patent No. 5,107,065, which disclosures are 
incorporated herein by reference. 

Other examples of tissue-specific promoters include those that direct expression in leaf 
cells following damage to the leaf (for example, from chewing insects), in tubers (for example, 
patatin gene promoter), and in fiber cells (an example of a developmentally-regulated fiber cell 
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protein is E6 (John et al., Proc. NatL Acad. Sci. USA , 89:5769 (1992). The E6 gene is most 
active in fiber, although low levels of transcripts are found in leaf, ovule and flower. 

The tissue-specificity of some "tissue-specific" promoters may not be absolute and may 
be tested by one skilled in the art using the diphtheria toxin sequence. One can also achieve 
tissue-specific expression with "leaky" expression by a combination of different tissue-specific 
promoters (Beals et al., Plant Cell 9: 1527 (1997)). Other tissue-specific promoters can be 
isolated by one skilled in the art (see U.S. 5,589,379). 

In one embodiment, the direction of the product from a polysaccharide hydrolysis gene, 
such as a-amylase, may be targeted to a particular organelle such as the apoplast rather than to 
the cytoplasm. This is exemplified by the use of the maize y-zein N-terminal signal sequence 
(SEQ ID NO: 17), which confers apoplast-specific targeting of proteins. Directing the protein or 
enzyme to a specific compartment will allow the enzyme to be localized in a manner that it will 
not come into contact with the substrate. In this manner the enzymatic action of the enzyme will 
not occur until the enzyme contacts its substrate. The enzyme can be contacted with its substrate 
by the process of milling (physical disruption of the cell integrity), or heating the cells or plant 
tissues to disrupt the physical integrity of the plant cells or organs that contain the enzyme. For 
example a mesophilic starch-hydrolyzing enzyme can be targeted to the apoplast or to the 
endoplasmic reticulum and so as not to come into contact with starch granules in the amyloplast. 
Milling of the grain will disrupt the integrity of the grain and the starch hydrolyzing enzyme will 
then contact the starch granules. In this manner the potential negative effects of co-localization 
of an enzyme and its substrate can be circumvented. 

In another embodiment, a tissue-specific promoter includes the endosperm-specific 
promoters such as the maize Y-zein promoter (exemplified by SEQ ID NO: 12) or the maize 
ADP-gpp promoter (exemplified by SEQ ED NO: 11, which includes a 5' untranslated and an 
intron sequence) or a Q protein promoter (exemplified by SEQ ID NO: 98) or a rice glutelin 1 
promoter (exemplified in SEQ ID NO:67). Thus, the present invention includes an isolated 
polynucleotide comprising a promoter comprising SEQ ID NO: 11, 12, 67, or 98, a 
polynucleotide which hybridizes to the complement thereof under low stringency hybridization 
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conditions, or a fragment thereof which has promoter activity, e.g., at least 10%, and preferably 
at least 50%, the activity of a promoter having SEQ ED NO:l 1, 12, 67, or 98. 

In another embodiment of the invention, the polynucleotide encodes a hyperthermophilic 
processing enzyme that is operably linked to a chloroplast (amyloplast) transit peptide (CTP) and 
a starch binding domain, e.g., from the waxy gene. An exemplary polynucleotide in this 
embodiment encodes SEQ ID NO: 10 (a-amylase linked to the starch binding domain from 
waxy). Other exemplary polynucleotides encode a hyperthermophilic processing enzyme linked 
to a signal sequence that targets the enzyme to the endoplasmic reticulum and secretion to the 
apoplast (exemplified by a polynucleotide encoding SEQ ED NO: 13, 27, or 30, which comprises 
the N-terminal sequence from maize y-zein operably linked to a-amylase, a-glucosidase, glucose 
isomerase, respectively), a hyperthermophilic processing enzyme linked to a signal sequence 
which retains the enzyme in the endoplasmic reticulum (exemplified by a polynucleotide 
encoding SEQ ED NO: 14, 26, 28, 29, 33, 34, 35, or 36, which comprises the N-terminal sequence 
from maize y-zein operably linked to the hyperthermophilic enzyme, which is operably linked to 
SEKDEL, wherein the enzyme is a-amylase, malA a-glucosidase, T. maritima glucose 
isomerase, T. neapolitana glucose isomerase), a hyperthermophilic processing enzyme linked to 
an N-terminal sequence that targets the enzyme to the amyloplast (exemplified by a 
polynucleotide encoding SEQ ID NO: 15, which comprises the N-terminal amyloplast targeting 
sequence from waxy operably linked to a-amylase), a hyperthermophilic fusion polypeptide 
which targets the enzyme to starch granules (exemplified by a polynucleotide encoding SEQ ID 
NO: 16, which comprises the N-terminal amyloplast targeting sequence from waxy operably 
linked to an a-amylase/waxy fusion polypeptide comprising the waxy starch binding domain), a 
hyperthermophilic processing enzyme linked to an ER retention signal (exemplified by a 
polynucleotide encoding SEQ ID NO:38 and 39). Moreover, a hyperthermophilic processing 
enzyme may be linked to a raw-starch binding site having the amino acid sequence (SEQ ID 
NO:53), wherein the polynucleotide encoding the processing enzyme is linked to the maize- 
optimized nuleic acid sequence (SEQ ED NO:54) encoding this binding site. 

Several inducible promoters have been reported. Many are described in a review by 
Gatz, in Current Opinion in Biotechnology , 7:168 (1996) and Gatz, C, Annu. Rev. Plant Physiol. 
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Plant Mol. Biol. , 48:89 (1997). Examples include tetracycline repressor system, Lac repressor 
system, copper-inducible systems, salicylate-inducible systems (such as the PRla system), 
glucocorticoid-inducible (Aoyama T. et al., N-H Plant Journal 11:605 (1997)) and ecdysone- 
inducible systems. Other inducible promoters include ABA- and turgor-inducible promoters, the 
promoter of the auxin-binding protein gene (Schwob et al., Plant J. , 4:423 (1993)), the UDP 
glucose flavonoid glycosyl-transferase gene promoter (Ralston et al., Genetics , 1 19 :185 (1988)), 
the MPI proteinase inhibitor promoter (Cordero et al., Plant J. , 6:141 (1994)), and the 
glyceraldehyde-3-phosphate dehydrogenase gene promoter (Kohler et al., Plant Mol. Biol. , 
29;1293 (1995); Quigley et al., J. Mol. Evol. , 29:412 (1989); Martinez et al., J. Mol. Biol. , 
208-551 (1989)). Also included are the benzene sulphonamide-inducible (U.S. 5364,780) and 
alcohol-inducible (WO 97/06269 and WO 97/06268) systems and glutathione S-transferase 
promoters. 

Other studies have focused on genes inducibly regulated in response to environmental 
stress or stimuli such as increased salinity, drought, pathogen and wounding. (Graham et al., J. 
Biol. Chem. , 260:6555 (1985); Graham et al., J. Biol. Chem. . 260:6561 (1985), Smith et al., 
Planta , 168:94 (1986)). Accumulation of metallocarboxypeptidase-inhibitor protein has been 
reported in leaves of wounded potato plants (Graham et ah, Biochem. Biophvs. Res. Comm. , 
101 : 1 1 64 (1981)). Other plant genes have been reported to be induced by methyl jasmonate, 
elicitors, heat-shock, anaerobic stress, or herbicide safeners. 

Regulated expression of a chimeric transacting viral replication protein can be further 

regulated by other genetic strategies, such as, for example, Cre-mediated gene activation (Odell 

et al. Mol. Gen. Genet. , U3:369 (1990)). Thus, a DNA fragment containing 3' regulatory 

sequence bound by lox sites between the promoter and the replication protein coding sequence 

that blocks the expression of a chimeric replication gene from the promoter can be removed by 

Cre-mediated excision and result in the expression of the trans-acting replication gene. In this 

case, the chimeric Cre gene, the chimeric trans-acting replication gene, or both can be under the 

control of tissue- and developmental-specific or inducible promoters. An alternate genetic 

strategy is the use of tRNA suppressor gene. For example, the regulated expression of a tRNA 

suppressor gene can conditionally control expression of a trans-acting replication protein coding 

sequence containing an appropriate termination codon (Ulmasov et al. Plant Mol. Biol. , 35:417 
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(1997)). Again, either the chimeric tRNA suppressor gene, the chimeric transacting replication 
gene, or both can be under the control of tissue- and developmental-specific or inducible 
promoters. 

Preferably, in the case of a multicellular organism, the promoter can also be specific to a 
particular tissue, organ or stage of development. Examples of such promoters include, but are 
not limited to, the Zea mays ADP-gpp and the Zea mays y-zein promoter and the Zea mays 
globulin promoter . 

Expression of a gene in a transgenic plant may be desired only in a certain time period 
during the development of the plant. Developmental timing is frequently correlated with tissue 
specific gene expression. For example, expression of zein storage proteins is initiated in the 
endosperm about 1 5 days after pollination. 

Additionally, vectors may be constructed and employed in the intracellular targeting of a 
specific gene product within the cells of a transgenic plant or in directing a protein to the 
extracellular environment. This will generally be achieved by joining a DNA sequence encoding 
a transit or signal peptide sequence to the coding sequence of a particular gene. The resultant 
transit, or signal, peptide will transport the protein to a particular intracellular, or extracellular 
destination, respectively, and will then be post-translationally removed. Transit or signal 
peptides act by facilitating the transport of proteins through intracellular membranes, e.g., 
vacuole, vesicle, plastid and mitochondrial membranes, whereas signal peptides direct proteins 
through the extracellular membrane. 

A signal sequence such as the maize y-zein N-terminal signal sequence for targeting to 
the endoplasmic reticulum and secretion into the apoplast may be operably linked to a 
polynucleotide encoding a hyperthermophilic processing enzyme in accordance with the present 
invention (Torrent et al., 1997). For example, SEQ ID NOs:13, 27, and 30 provides for a 
polynucleotide encoding a hyperthermophilic enzyme operably linked to the N-terminal 
sequence from maize y-zein protein. Another signal sequence is the amino acid sequence 
SEKDEL for retaining polypeptides in the endoplasmic reticulum (Munro and Pelham, 1987). 
For example, a polynucleotide encoding SEQ ID NOS:14, 26, 28, 29, 33, 34, 35, or 36, which 
comprises the N-terminal sequence from maize y-zein operably linked to a processing enzyme 
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which is operably linked to SEKDEL. A polypeptide may also be targeted to the amyloplast by 
fusion to the waxy amyloplast targeting peptide (Klosgen et ah, 1986) or to a starch granule. For 
example, the polynucleotide encoding a hyperthermophilic processing enzyme may be operably 
linked to a chloroplast (amyloplast) transit peptide (CTP) and a starch binding domain, e.g., from 
the waxy gene. SEQ ID NO: 10 exemplifies a-amylase linked to the starch binding domain from 
waxy. SEQ ID NO: 15 exemplifies the N-terminal sequence amyloplast targeting sequence from 
waxy operably linked to a-amylase. Moreover, the polynucleotide encoding the processing 
enzyme may be fused to target starch granules using the waxy starch binding domain. For 
example, SEQ ED NO: 16 exemplifies a fusion polypeptide comprising the N-terminal amyloplast 
targeting sequence from waxy operably linked to an ct-amylase/waxy fusion polypeptide 
comprising the waxy starch binding domain. 

The polynucleotides of the present invention, in addition to processing signals, may 
further include other regulatory sequences, as is known in the art. "Regulatory sequences" and 
"suitable regulatory sequences" each refer to nucleotide sequences located upstream (5' non- 
coding sequences), within, or downstream (3' non-coding sequences) of a coding sequence, and 
which influence the transcription, RNA processing or stability, or translation of the associated 
coding sequence. Regulatory sequences include enhancers, promoters, translation leader 
sequences, introns, and polyadenylation signal sequences. They include natural and synthetic 
sequences as well as sequences, which may be a combination of synthetic and natural sequences. 

Selectable markers may also be used in the present invention to allow for the selection of 
transformed plants and plant tissue, as is well-known in the art. One may desire to employ a 
selectable or screenable marker gene as, or in addition to, the expressible gene of interest. 
"Marker genes" are genes that impart a distinct phenotype to cells expressing the marker gene 
and thus allow such transformed cells to be distinguished from cells that do not have the marker. 
Such genes may encode either a selectable or screenable marker, depending on whether the 
marker confers a trait which one can select for by chemical means, i.e., through the use of a 
selective agent (e.g., a herbicide, antibiotic, or the like), or whether it is simply a trait that one 
can identify through observation or testing, i.e., by screening (e.g., the R-locus trait). Of course, 
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many examples of suitable marker genes are known to the art and can be employed in the 

practice of the invention. 

Included within the terms selectable or screenable marker genes are also genes which 
encode a "secretable marker" whose secretion can be detected as a means of identifying or 
selecting for transformed cells. Examples include markers which encode a secretable antigen that 
can be identified by antibody interaction, or even secretable enzymes which can be detected by 
their catalytic activity. Secretable proteins fall into a number of classes, including small, 
diffusible proteins detectable, e.g., by ELISA; small active enzymes detectable in extracellular 
solution (e.g., a-amylase, P-lactamase, phosphinothricin acetyltransferase); and proteins that are 
inserted or trapped in the cell wall (e.g., proteins that include a leader sequence such as that 
found in the expression unit of extensin or tobacco PR-S). 

With regard to selectable secretable markers, the use of a gene that encodes a protein that 
becomes sequestered in the cell wall, and which protein includes a unique epitope is considered 
to be particularly advantageous. Such a secreted antigen marker would ideally employ an 
epitope sequence that would provide low background in plant tissue, a promoter-leader sequence 
that would impart efficient expression and targeting across the plasma membrane, and would 
produce protein that is bound in the cell wall and yet accessible to antibodies. A normally 
secreted wall protein modified to include a unique epitope would satisfy all such requirements. 

One example of a protein suitable for modification in this manner is extensin, or 
hydroxyproline rich glycoprotein (HPRG). For example, the maize HPRG (Steifel et al., The 
Plant CelU 2:785 (1990)) molecule is well characterized in terms of molecular biology, 
expression and protein structure. However, any one of a variety of extensins and/or glycine-rich 
wall proteins (Keller et al., EMBO Journal , 8:1309 (1989)) could be modified by the addition of 
an antigenic site to create a screenable marker. 

a. Selectable Markers 

Possible selectable markers for use in connection with the present invention include, but 
are not limited to, a neo or nptll gene (Potrykus et al., Mol. Gen. Genet ., 199: 183 (1 985)) which 
codes for kanamycin resistance and can be selected for using kanamycin, G418, and the like; a 
bar gene which confers resistance to the herbicide phosphinothricin; a gene which encodes an 
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altered EPSP synthase protein (Hinchee et al., Biotech ., 6:915 (1988)) thus conferring glyphosate 
resistance; a nitrilase gene such as bxn from Klebsiella ozaenae which confers resistance to 
bromoxynil (Stalker et al., Science , 242:419 (1988)); a mutant acetolactate synthase gene (ALS) 
which confers resistance to imidazolinone, sulfonylurea or other ALS-inhibiting chemicals 
(European Patent Application 154,204, 1985); a methotrexate-resistant DHFR gene (Thillet et 
al., J. Biol. Chem ., 263:12500 (1988)); a dalapon dehalogenase gene that confers resistance to 
the herbicide dalapon; a phosphomannose isomerase (PMI) gene; a mutated anthranilate synthase 
gene that confers resistance to 5-methyl tryptophan; the hph gene which confers resistance to the 
antibiotic hygromycin; or the mannose-6-phosphate isomerase gene (also referred to herein as 
the phosphomannose isomerase gene), which provides the ability to metabolize mannose (U.S. 
Patent Nos. 5,767,378 and 5,994,629). One skilled in the art is capable of selecting a suitable 
selectable marker gene for use in the present invention. Where a mutant EPSP synthase gene is 
employed, additional benefit may be realized through the incorporation of a suitable chloroplast 
transit peptide, CTP (European Patent Application 0,218,571, 1987). 

An illustrative embodiment of a selectable marker gene capable of being used in systems 
to select transformants are the genes that encode the enzyme phosphinothricin acetyltransferase, 
such as the bar gene from Streptomyces hygroscopicus or the pat gene from Streptomyces 
viridochromogenes . The enzyme phosphinothricin acetyl transferase (PAT) inactivates the 
active ingredient in the herbicide bialaphos, phosphinothricin (PPT). PPT inhibits glutamine 
synthetase, (Murakami et al., Mol. Gen. Genet ., 205:42 (1986); Twell et al., Plant Physiol.. 
91:1270 (1989)) causing rapid accumulation of ammonia and cell death. The success in using 
this selective system in conjunction with monocots was particularly surprising because of the 
major difficulties which have been reported in transformation of cereals (Potrykus, Trends 
Biotech., 7:269(1989)). 

Where one desires to employ a bialaphos resistance gene in the practice of the invention, 

a particularly useful gene for this purpose is the bar or pat genes obtainable from species of 

Streptomyces (e.g., ATCC No. 21,705). The cloning of the bar gene has been described 

(Murakami et al., Mol. Gen. Genet ., 205:42 (1986); Thompson et al., EMBO Journal , 6:2519 

(1987)) as has the use of the bar gene in the context of plants other than monocots (De Block et 

al., EMBO Journal , 6:2513 (1987); De Block et al., Plant Physiol ., 91:694 (1989)). 
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h. Screenable Markers 

Screenable markers that may be employed include, but are not limited to, a p- 
glucuronidase or uidA gene (GUS) which encodes an enzyme for which various chromogenic 
substrates are known; an R-locus gene, which encodes a product that regulates the production of 
anthocyanin pigments (red color) in plant tissues (Dellaporta et al., in Chromosome Structure 
and Function , pp. 263-282 (1988)); a p-lactamase gene (Sutcliffe, PNAS USA, 75:3737 (1978)), 
which encodes an enzyme for which various chromogenic substrates are known (e.g., PADAC, a 
chromogenic cephalosporin); a xylE gene (Zukowsky et al., PNAS USA, 80:1 101 (1983)) which 
encodes a catechol dioxygenase that can convert chromogenic catechols; an a-amylase gene 
(Dcuta et al., Biotech. , 8:241 (1990)); a tyrosinase gene (Katz et al., J. Gen. Microbiol., 129:2703 
(1983)) which encodes an enzyme capable of oxidizing tyrosine to DOPA and dopaquinone 
which in turn condenses to form the easily detectable compound melanin; a p-galactosidase 
gene, which encodes an enzyme for which there are chromogenic substrates; a luciferase (lux) 
gene (Ow et al, Science , 234:856 (1986)), which allows for bioluminescence detection; or an 
aequorin gene (Prasher et al., Biochem. Biophvs. Res. Comm. , 126:1259 (1985)), which may be 
employed in calcium-sensitive bioluminescence detection, or a green fluorescent protein gene 
(Niedz et al., Plant Cell Reports , 14: 403 (1995)). 

Genes from the maize R gene complex are contemplated to be particularly useful as 
screenable markers. The R gene complex in maize encodes a protein that acts to regulate the 
production of anthocyanin pigments in most seed and plant tissue. A gene from the R gene 
complex is suitable for maize transformation, because the expression of this gene in transformed 
cells does not harm the cells. Thus, an R gene introduced into such cells will cause the 
expression of a red pigment and, if stably incorporated, can be visually scored as a red sector. If 
a maize line carries dominant allelles for genes encoding the enzymatic intermediates in the 
anthocyanin biosynthetic pathway (C2, Al, A2, Bzl and Bz2), but carries a recessive allele at 
the R locus, transformation of any cell from that line with R will result in red pigment formation. 
Exemplary lines include Wisconsin 22 which contains the rg-Stadler allele and TR1 12, a K55 
derivative which is r-g, b, PI. Alternatively any genotype of maize can be utilized if the CI and 
R alleles are introduced together. A further screenable marker contemplated for use in the 

40 



WO 2005/096804 



PCT/US2004/007182 



present invention is firefly luciferase, encoded by the lux gene. The presence of the lux gene in 
transformed cells may be detected using, for example, X-ray film, scintillation counting, 
fluorescent spectrophotometry, low-light video cameras, photon counting cameras or multiwell 
luminometry. It is also envisioned that this system may be developed for populational screening 
for bioluminescence, such as on tissue culture plates, or even for whole plant screening. 

The polynucleotides used to transform the plant may include, but is not limited to, DNA 
from plant genes and non-plant genes such as those from bacteria, yeasts, animals or viruses. 
The introduced DNA can include modified genes, portions of genes, or chimeric genes, including 
genes from the same or different maize genotype. The term "chimeric gene" or "chimeric DNA" 
is defined as a gene or DNA sequence or segment comprising at least two DNA sequences or 
segments from species which do not combine DNA under natural conditions, or which DNA 
sequences or segments are positioned or linked in a manner which does not normally occur in the 
native genome of the untransformed plant. 

Expression cassettes comprising the polynucleotide encoding a hyperthermophilic 
processing enzyme, and preferably a codon-optimized polynucleotide is further provided. It is 
preferred that the polynucleotide in the expression cassette (the first polynucleotide) is operably 
linked to regulatory sequences, such as a promoter, an enhancer, an intron, a termination 
sequence, or any combination thereof, and, optionally, to a second polynucleotide encoding a 
signal sequence (N- or C-terminal) which directs the enzyme encoded by the first polynucleotide 
to a particular cellular or subcellular location. Thus, a promoter and one or more signal 
sequences can provide for high levels of expression of the enzyme in particular locations in a 
plant, plant tissue or plant cell. Promoters can be constitutive promoters, inducible (conditional) 
promoters or tissue-specific promoters, e.g., endosperm-specific promoters such as the maize 7- 
zein promoter (exemplified by SEQ ID NO: 12) or the maize ADP-gpp promoter (exemplified by 
SEQ ED NO:l 1, which includes a 5' untranslated and an intron sequence). The invention also 
provides an isolated polynucleotide comprising a promoter comprising SEQ ED NO:l 1 or 12, a 
polynucleotide which hybridizes to the complement thereof under low stringency hybridization 
conditions, or a fragment thereof which has promoter activity, e.g., at least 10%, and preferably 
at least 50%, the activity of a promoter having SEQ ED NO:l 1 or 12. Also provided are vectors 
which comprise the expression cassette or polynucleotide of the invention and transformed cells 
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comprising the polynucleotide, expression cassette or vector of the invention. A vector of the 
invention can comprise a polynucleotide sequence which encodes more than one 
hyperthermophilic processing enzyme of the invention, which sequence can be in sense or 
antisense orientation, and a transformed cell may comprise one or more vectors of the invention. 
Preferred vectors are those useful to introduce nucleic acids into plant cells. 

Transformation 

The expression cassette, or a vector construct containing the expression cassette may be 
inserted into a cell. The expression cassette or vector construct may be carried episomally or 
integrated into the genome of the cell. The transformed cell may then be grown into a transgenic 
plant. Accordingly, the invention provides the products of the transgenic plant. Such products 
may include, but are not limited to, the seeds, fruit, progeny, and products of the progeny of the 
transgenic plant. 

A variety of techniques are available and known to those skilled in the art for 
introduction of constructs into a cellular host. Transformation of bacteria and many eukaryotic 
cells may be accomplished through use of polyethylene glycol, calcium chloride, viral infection, 
phage infection, electroporation and other methods known in the art. Techniques for 
transforming plant cells or tissue include transformation with DNA employing A. twnefaciens or 
A. rhizogenes as the transforming agent, electroporation, DNA injection, microprojectile 
bombardment, particle acceleration, etc. (See, for example, EP 295959 and EP 138341). 

In one embodiment, binary type vectors of Ti and Ri plasmids of Agrobacterium spp. Ti- 
derived vectors are used to transform a wide variety of higher plants, including 
monocotyledonous and dicotyledonous plants, such as soybean, cotton, rape, tobacco, and rice 
(Pacciotti et al. Bio/Technology, 3:241 (1985): Byrne et al. Plant Cell Tissue and Organ Culture , 
8:3 (1987); Sukhapinda et al. Plant Mol. Biol. , 8:209 (1987); Lorz et al. Mol. Gen. Genet. , 
199:178 (1985); Potrykus Mol. Gen. Genet. , 199:183 (1985); Park et al., J. Plant Biol. , 38:365 
(1985): Hiei et al., Plant J. , 6:271(1994)). The use of T-DNA to transform plant cells has 
received extensive study and is amply described (EP 120516; Hoekema, In: The Binary Plant 
Vector System . Offset-drukkerij Kanters B.V.; Alblasserdam (1985), Chapter V; Knauf, et al., 
Genetic Analysis of Host Range Expression by Agrobacterium In: Molecular Genetics of the 
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Bacteria-Plant Interaction , Puhler, A. ed., Springer- Verlag, New York, 1983, p. 245; and An. et 
al., EMBOL, 4:277 (1985)). 

Other transformation methods are available to those skilled in the art, such as direct 
uptake of foreign DNA constructs (see EP 295959), techniques of electroporation (Fromm et al. 
Nature (London) , 319 :791 (1986), or high velocity ballistic bombardment with metal particles 
coated with the nucleic acid constructs (Kline et al. Nature (London) 327 :70 (1987), and U.S. 
Patent No. 4,945,050). Once transformed, the cells can be regenerated by those skilled in the art. 
Of particular relevance are the recently described methods to transform foreign genes into 
commercially important crops, such as rapeseed (De Block et al., Plant Physiol. 91 :694-701 
(1989)), sunflower (Everett et al., Bio/Technology , 5:1201(1987)), soybean (McCabe et al., 
Bio/Technology , 6:923 (1988); Hinchee et al., Bio/Technology , 6:915 (1988); Chee et al., Plant 
Physiol. , 91:1212 (1989); Christou et al., Proc. Natl. Acad. Sci USA , 86:7500 (1989) EP 
301749), rice (Hiei et al., Plant J. , 6:271 (1994)), and corn (Gordon Kamm et al., Plant Cell , 
2:603 (1990); Fromm et al., Biotechnology , 8:833, (1990)). 

Expression vectors containing genomic or synthetic fragments can be introduced into 
protoplasts or into intact tissues or isolated cells. Preferably expression vectors are introduced 
into intact tissue. General methods of culturing plant tissues are provided, for example, by Maki 
et al. "Procedures for Introducing Foreign DNA into Plants" in Methods in Plant Molecular 
Biology & Biotechnology, Glich et al. (Eds.), pp. 67-88 CRC Press (1993); and by Phillips et al. 
"Cell-Tissue Culture and In-Vitro Manipulation" in Corn & Corn Improvement, 3rd Edition 10, 
Sprague et al. (Eds.) pp. 345-387, American Society of Agronomy Inc. (1988). 

In one embodiment, expression vectors may be introduced into maize or other plant 
tissues using a direct gene transfer method such as microprojectile-mediated delivery, DNA 
injection, electroporation and the like. Expression vectors are introduced into plant tissues using 
the microprojectile media delivery with the biolistic device. See, for example, Tomes et al. 
"Direct DNA transfer into intact plant cells via microprojectile bombardment" in Gamborg and 
Phillips (Eds.) Plant Cell, Tissue and Organ Culture: Fundamental Methods, Springer Verlag, 
Berlin (1995). Nevertheless, the present invention contemplates the transformation of plants 
with a hyperthermophilic processing enzyme in accord with known transforming methods. Also 
see, Weissinger et al., Annual Rev. Genet. , 22:421 (1988); Sanford et al., Particulate Science and 
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Technology . 5:27 (1987) (onion); Christou et al., Plant Physiol. , 87:671 (1988 )(soybean); 
McCabe et al., Bio/Technology . 6:923 (1988) (soybean); Datta et al., Bio/Technology , 8:736 
(1990) (rice); Klein et al., Proc. Natl. Acad. Sci. USA , 85:4305 (1988 )(maize); Klein et al., 
Bio/Technology , 6:559 (1988)(maize); Klein et al., Plant Physiol. , 91:440 (1988)(maize); Fromm 
et al., Bio/Technology , 8:833 (1990) (maize); and Gordon-Kamm et al., Plant Cell 2, 603 
(1990)(maize); Svab et al., Proc. Natl. Acad. Sci. USA , 87:8526 (1990) (tobacco chloroplast); 
Koziel et al., Biotechnology , 11:194 (1993) (maize); Shimamoto et al., Nature , 338:274 (1989) 
(rice); Christou et al., Biotechnology , 9:957 (1991) (rice); European Patent Application EP 0 332 
581 (orchardgrass and other Pooideae); Vasil et al., Biotechnology , 11:1553 (1993) (wheat); 
Weeks et al., Plant Physiol. , 102:1077 (1993) (wheat). Methods in Molecular Biology, 82. 
Arabidopsis Protocols Ed. Martinez-Zapater and Salinas 1998 Humana Press (Arabidopsis). 

Transformation of plants can be undertaken with a single DNA molecule or multiple 
DNA molecules (i.e., co-transformation), and both these techniques are suitable for use with the 
expression cassettes and constructs of the present invention. Numerous transformation vectors 
are available for plant transformation, and the expression cassettes of this invention can be used 
in conjunction with any such vectors. The selection of vector will depend upon the preferred 
transformation technique and the target species for transformation. 

Ultimately, the most desirable DNA segments for introduction into a monocot genome 
may be homologous genes or gene families which encode a desired trait (e.g., hydrolysis of 
proteins, lipids or polysaccharides) and which are introduced under the control of novel 
promoters or enhancers, etc., or perhaps even homologous or tissue specific (e.g., root-, 
collar/sheath-, whorl-, stalk-, earshank-, kernel- or leaf-specific) promoters or control elements. 
Indeed, it is envisioned that a particular use of the present invention will be the targeting of a 
gene in a constitutive manner or in an inducible manner. 

Examples of Suitable Transformation Vectors 

Numerous transformation vectors available for plant transformation are known to those 
of ordinary skill in the plant transformation arts, and the genes pertinent to this invention can be 
used in conjunction with any such vectors known in the art. The selection of vector will depend 
upon the preferred transformation technique and the target species for transformation. 
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a. Vectors Suitable for Azrobacterium Transformation 

Many vectors are available for transformation using Agrobacteriwn tumefaciens. These 
typically carry at least one T-DNA border sequence and include vectors such as pBEN19 (Bevan, 
Nucl. Acids Res. (1984)). Below, the construction of two typical vectors suitable for 
Agrobacteriwn transformation is described. 

pCIB200 and pCIB2001 

The binary vectors pcLB200 and pCEB2001 are used for the construction of recombinant 
vectors for use with Agrobacterium and are constructed in the following manner. pTJS75kan is 
created by Narl digestion of pTJS75 (Schmidhauser & Helinski, J. Bacteriol. , 164: 446 (1985)) 
allowing excision of the tetracycline-resistance gene, followed by insertion of an AccI fragment 
from pUC4K carrying an NPTII (Messing & Vierra, Gene , 19: 259 (1982): Bevan et al., Nature , 
304 : 184 (1983): McBride et al., Plant Molecular Biology , 14: 266 (1990)). Xhol linkers are 
ligated to the EcoRV fragment of PCEB7 which contains the left and right T-DNA borders, a 
plant selectable nos/nptll chimeric gene and the pUC polylinker (Rothstein et al., Gene , 53: 153 
(1987)), and the Xhol-digested fragment are cloned into Sail-digested pTJS75kan to create 
pCIB200 (see also EP 0 332 104, example 19). pCIB200 contains the following unique 
polylinker restriction sites: EcoRI, SstI, Kpnl, Bglll, Xbal, and Sail. pCEB2001 is a derivative of 
pCIB200 created by the insertion into the polylinker of additional restriction sites. Unique 
restriction sites in the polylinker of pCIB2001 are EcoRI, SstI, Kpnl, Bglll, Xbal, Sail, Mlul, 
Bell, Avrll, Apal, Hpal, and Stul. pCEB2001, in addition to containing these unique restriction 
sites also has plant and bacterial kanamycin selection, left and right T-DNA borders for 
Agrobacterium-mediated transformation, the RK2-derived trfA function for mobilization 
between E. coli and other hosts, and the OnTand OriV functions also from RK2. The pCBB2001 
polylinker is suitable for the cloning of plant expression cassettes containing their own 
regulatory signals. 
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pCIBlO and Hygromvcin Selection Derivatives thereof : 

The binary vector pCEBlO contains a gene encoding kanamycin resistance for selection in 
plants and T-DNA right and left border sequences and incorporates sequences from the wide 
host-range plasmid pRK252 allowing it to replicate in both E. coli and Agrobacterium. Its 
construction is described by Rothstein et al. ( Gene , 53 : 153 (1987)). Various derivatives of 
pCIBlO are constructed which incorporate the gene for hygromycin B phosphotransferase 
described by Gritz et al. ( Gene , 25 : 179 (1983)). These derivatives enable selection of transgenic 
plant cells on hygromycin only (pCEB743), or hygromycin and kanamycin (pCEB715, pCEB717). 

b. Vectors Suitable for non-Azrobacterium Transformation 

Transformation without the use of Agrobacterium tumefaciens circumvents the 
requirement for T-DNA sequences in the chosen transformation vector and consequently vectors 
lacking these sequences can be utilized in addition to vectors such as the ones described above 
which contain T-DNA sequences. Transformation techniques that do not rely on Agrobacterium 
include transformation via particle bombardment, protoplast uptake (e.g., PEG and 
electroporation) and microinjection. The choice of vector depends largely on the preferred 
-selection for the species being transformed. Non-limiting examples of the construction of typical 
vectors suitable for non-Agrobacterium transformation is further described. 

pCIB3064 

pCEB3064 is a pUC-derived vector suitable for direct gene transfer techniques in 
combination with selection by the herbicide basta (or phosphinothricin). The plasmid pCEB246 
comprises the CaMV 35 S promoter in operational fusion to the E. coli GUS gene and the CaMV 
35S transcriptional terminator and is described in the PCT published application WO 93/07278. 
The 35S promoter of this vector contains two ATG sequences 5' of the start site. These sites are 
mutated using standard PCR techniques in such a way as to remove the ATGs and generate the 
restriction sites Sspl and PvuII. The new restriction sites are 96 and 37 bp away from the unique 
Sail site and 101 and 42 bp away from the actual start site. The resultant derivative of pCIB246 
is designated pCIB3025. The GUS gene is then excised from pCIB3025 by digestion with Sail 
and SacI, the termini rendered blunt and religated to generate plasmid pCEB3060. The plasmid 
pJIT82 may be obtained from the John Innes Centre, Norwich and the a 400 bp Smal fragment 
containing the bar gene from Streptomyces viridochromogenes is excised and inserted into the 
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Hpal site of pCEB3060 (Thompson et al., EMBO J , 6: 2519 (1987)). This generated pCEB3064, 
which comprises the bar gene under the control of the CaMV 35S promoter and terminator for 
herbicide selection, a gene for ampicillin resistance (for selection in E. coli) and a polylinker 
with the unique sites SphI, PstI, Hindlll, and BamHI. This vector is suitable for the cloning of 
plant expression cassettes containing their own regulatory signals. 
pSOG19and pSOG35 : 

The plasmid pSOG35 is a transformation vector that utilizes the E. coli gene 
dihydrofolate reductase (DHFR) as a selectable marker conferring resistance to methotrexate. 
PCR is used to amplify the 35S promoter (-800 bp), intron 6 from the maize Adhl gene (-550 
bp) and 18 bp of the GUS untranslated leader sequence from pSOGlO. A 250-bp fragment 
encoding the E. coli dihydrofolate reductase type II gene is also amplified by PCR and these two 
PCR fragments are assembled with a SacI-PstI fragment from pB1221 (Clontech) which 
comprises the pUC19 vector backbone and the nopaline synthase terminator. Assembly of these 
fragments generates pSOG19 which contains the 35 S promoter in fusion with the intron 6 
sequence, the GUS leader, the DHFR gene and the nopaline synthase terminator. Replacement 
of the GUS leader in pSOG19 with the leader sequence from Maize Chlorotic Mottle Virus 
(MCMV) generates the vector pSOG35. pSOG19 and pSOG35 carry the pUC gene for 
ampicillin resistance and have Hindlll, SphI, PstI and EcoRI sites available for the cloning of 
foreign substances. 

c. Vector Suitable for Chloroplast Transformation 

For expression of a nucleotide sequence of the present invention in plant plastids, plastid 
transformation vector pPH143 (WO 97/3201 1, example 36) is used. The nucleotide sequence is 
inserted into pPH143 thereby replacing the PROTOX coding sequence. This vector is then used 
for plastid transformation and selection of transformants for spectinomycin resistance. 
Alternatively, the nucleotide sequence is inserted in pPH143 so that it replaces the aadH gene. In 
this case, transformants are selected for resistance to PROTOX inhibitors. 

Plant Hosts Subject to Transformation Methods 

Any plant tissue capable of subsequent clonal propagation, whether by organogenesis 
or embryogenesis, may be transformed with a construct of the present invention. The term 
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organogenesis means a process by which shoots and roots are developed sequentially from 
meristematic centers while the term embryogenesis means a process by which shoots and roots 
develop together in a concerted fashion (not sequentially), whether from somatic cells or 
gametes. The particular tissue chosen will vary depending on the clonal propagation systems 
available for, and best suited to, the particular species being transformed. Exemplary tissue 
targets include differentiated and undifferentiated tissues or plants, including but not limited to 
leaf disks, roots, stems, shoots, leaves, pollen, seeds, embryos, cotyledons, hypocotyls, 
megagametophytes, callus tissue, existing meristematic tissue (e.g., apical meristems, axillary 
buds, and root meristems), and induced meristem tissue (e.g., cotyledon meristem and hypocotyl 
meristem), tumor tissue, and various forms of cells and culture such as single cells, protoplast, 
embryos, and callus tissue. The plant tissue may be in plants or in organ, tissue or cell culture. 

Plants of the present invention may take a variety of forms. The plants may be chimeras 
of transformed cells and non-transformed cells; the plants may be clonal transformants (e.g., all 
cells transformed to contain the expression cassette); the plants may comprise grafts of 
transformed and untransformed tissues (e.g., a transformed root stock grafted to an 
untransformed scion in citrus species). The transformed plants may be propagated by a variety 
of means, such as by clonal propagation or classical breeding techniques. For example, first 
generation (or Tl) transformed plants may be selfed to give homozygous second generation (or 
T2) transformed plants, and the T2 plants further propagated through classical breeding 
techniques. A dominant selectable marker (such as npt D) can be associated with the expression 

cassette to assist in breeding. 

The present invention may be used for transformation of any plant species, including 
monocots or dicots, including, but not limited to, corn (Zea mays), Brassica sp. (e.g., B. napus, 
B. rapa, B. juncea), particularly those Brassica species useful as sources of seed oil, alfalfa 
(Medicago sativa), rice (Oryza sativa), rye (Secale cereale), sorghum (Sorghum bicolor, 
Sorghum vulgare), millet (e.g., pearl millet (Pennisetum glaucum), proso millet (Panicum 
miliaceum), foxtail millet (Setaria italica), finger millet (Eleusine coracana)), sunflower 
(Helianthus annuus), safflower (Carthamus tinctorius), wheat (Triticum aestivum), soybean 
(Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanuts (Arachis 
hypogaea), cotton (Gossypium barbadense, Gossypium hirsutum), sweet potato (Ipomoea 
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batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), pineapple 
(Ananas comosus), citrus trees (Citrus spp.), cocoa (Theobroma cacao), tea (Camellia sinensis), 
banana (Musa spp.), avocado (Persea americana), fig (Ficus casica), guava (Psidium guajava), 
mango (Mangifera indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium 
occidentale), macadamia (Macadamia integrifolia), almond (Prunus amygdalus), sugar beets 
(Beta vulgaris), sugarcane (Saccharum spp.), oats, barley, vegetables, ornamentals, woody plants 
such as conifers and deciduous trees, squash, pumpkin, hemp, zucchini, apple, pear, quince, 
melon, plum, cherry, peach, nectarine, apricot, strawberry, grape, raspberry, blackberry, soybean, 
sorghum, sugarcane, rapeseed, clover, carrot, and Arabidopsis thaliana. 

Vegetables include tomatoes (Lycopersicon esculentum), lettuce (e.g., Lactuca sativa), 
green beans (Phaseolus vulgaris), lima beans (Phaseolus limensis), peas (Lathyrus spp.), 
cauliflower, broccoli, turnip, radish, spinach, asparagus, onion, garlic, pepper, celery, and 
members of the genus Cucumis such as cucumber (C. sativus), cantaloupe (C. cantalupensis), 
and musk melon (C. melo). Ornamentals include azalea (Rhododendron spp.), hydrangea 
(Macrophylla hydrangea), hibiscus (Hibiscus rosasanensis), roses (Rosa spp.), tulips (Tulipa 
spp.), daffodils (Narcissus spp.), petunias (Petunia hybrida), carnation (Dianthus caryophyllus), 
poinsettia (Euphorbia pulcherrima), and chrysanthemum. Conifers that may be employed in 
practicing the present invention include, for example, pines such as loblolly pine (Pinus taeda), 
slash pine (Pinus elliotii), ponderosa pine (Pinus ponderosa), lodgepole pine (Pinus contorta), 
and Monterey pine (Pinus radiata), Douglas-fir (Pseudotsuga menziesii); Western hemlock 
(Tsuga canadensis); Sitka spruce (Picea glauca); redwood (Sequoia sempervirens); true firs such 
as silver fir (Abies amabilis) and balsam fir (Abies balsamea); and cedars such as Western red 
cedar (Thuja plicata) and Alaska yellow-cedar (Chamaecyparis nootkatensis). Leguminous 
plants include beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden 
beans, cowpea, mungbean, lima bean, fava bean, lentils, chickpea, etc. Legumes include, but are 
not limited to, Arachis, e.g., peanuts, Vicia, e.g., crown vetch, hairy vetch, adzuki bean, mung 
bean, and chickpea, Lupinus, e.g., lupine, trifolium, Phaseolus, e.g., common bean and lima 
bean, Pisum, e.g., field bean, Melilotus, e.g., clover, Medicago, e.g., alfalfa, Lotus, e.g., trefoil, 
lens, e.g., lentil, and false indigo. Preferred forage and turf grass for use in the methods of the 
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invention include alfalfa, orchard grass, tall fescue, perennial ryegrass, creeping bent grass, and 
redtop. 

Preferably, plants of the present invention include crop plants, for example, corn, alfalfa, 
sunflower, Brassica, soybean, cotton, safflower, peanut, sorghum, wheat, millet, tobacco, barley, 
rice, tomato, potato, squash, melons, legume crops, etc. Other preferred plants include Liliopsida 
and Panicoideae. 

Once a desired DNA sequence has been transformed into a particular plant species, it 
may be propagated in that species or moved into other varieties of the same species, particularly 
including commercial varieties, using traditional breeding techniques. 

Below are descriptions of representative techniques for transforming both dicotyledonous 
and monocotyledonous plants, as well as a representative plastid transformation technique. 

a. Transformation of Dicotyledons 

Transformation techniques for dicotyledons are well known in the art and include 
Agrobacterium-based techniques and techniques that do not require Agrobacterium. Non- 
Agrobacterium techniques involve the uptake of exogenous genetic material directly by 
protoplasts or cells. This can be accomplished by PEG or electroporation mediated uptake, 
particle bombardment-mediated delivery, or microinjection. Examples of these techniques are 
described by Paszkowski et al., EMBOJ, 3: 2717 (1984), Potrykus et al., Mol. Gen. Genet., 199: 
169 (1985), Reich et al., Biotechnology , 4: 1001 (1986), and Klein et al., Nature , 327: 70 (1987). 
In each case the transformed cells are regenerated to whole plants using standard techniques 
known in the art. 

Agrobacterium-mcdiatcd transformation is a preferred technique for transformation of 
dicotyledons because of its high efficiency of transformation and its broad utility with many 
different species. Agrobacterium transformation typically involves the transfer of the binary 
vector carrying the foreign DNA of interest (e.g. pCEB200 or pCEB2001) to an appropriate 
Agrobacterium strain which may depend on the complement of vir genes carried by the host 
Agrobacterium strain either on a co-resident Ti plasmid or chromosomally (e.g., strain CIB542 
for pCIB200 and pCIB2001 (Uknes et al., Plant Cell 5: 159 (1993)). The transfer of the 
recombinant binary vector to Agrobacterium is accomplished by a triparental mating procedure 
using E. coli carrying the recombinant binary vector, a helper E. coli strain which carries a 
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plasmid such as pRK2013 and which is able to mobilize the recombinant binary vector to the 
target Agrobacterium strain. Alternatively, the recombinant binary vector can be transferred to 
Agrobacterium by DNA transformation (Hofgen & Willmitzer, Nucl. Acids Res. , 16: 9877 
(1988)). 

Transformation of the target plant species by recombinant Agrobacterium usually involves 
co-cultivation of the Agrobacterium with explants from the plant and follows protocols well 
known in the art. Transformed tissue is regenerated on selectable medium carrying the antibiotic 
or herbicide resistance marker present between the binary plasmid T-DNA borders. 

The vectors may be introduced to plant cells in known ways. Preferred cells for 
transformation include Agrobacterium, monocot cells and dicots cells, including Liliopsida cells 
and Panicoideae cells. Preferred monocot cells are cereal cells, e.g., maize (corn), barley, and 
wheat, and starch accumulating dicot cells, e.g., potato. 

Another approach to transforming a plant cell with a gene involves propelling inert or 
biologically active particles at plant tissues and cells. This technique is disclosed in U.S. Patent 
Nos. 4,945,050, 5,036,006, and 5,100,792. Generally, this procedure involves propelling inert or 
biologically active particles at the cells under conditions effective to penetrate the outer surface 
of the cell and afford incorporation within the interior thereof. When inert particles are utilized, 
the vector can be introduced into the cell by coating the particles with the vector containing the 
desired gene. Alternatively, the target cell can be surrounded by the vector so that the vector is 
carried into the cell by the wake of the particle. Biologically active particles (e.g., dried yeast 
cells, dried bacterium or a bacteriophage, each containing DNA sought to be introduced) can also 
be propelled into plant cell tissue. 

b. Transformation of Monocotyledons 

Transformation of most monocotyledon species has now also become routine. Preferred 

techniques include direct gene transfer into protoplasts using polyethylene glycol (PEG) or 

electroporation techniques, and particle bombardment into callus tissue. Transformations can be 

undertaken with a single DNA species or multiple DNA species (i.e., co-transformation) and 

both these techniques are suitable for use with this invention. Co-transformation may have the 

advantage of avoiding complete vector construction and of generating transgenic plants with 

unlinked loci for the gene of interest and the selectable marker, enabling the removal of the 
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selectable marker in subsequent generations, should this be regarded desirable. However, a 
disadvantage of the use of co-transformation is the less than 100% frequency with which 
separate DNA species are integrated into the genome (Schocher et al., Biotechnology , 4: 1093 
1986)). 

Patent Applications EP 0 292 435, EP 0 392 225, and WO 93/07278 describe techniques 
for the preparation of callus and protoplasts from an elite inbred line of maize, transformation of 
protoplasts using PEG or electroporation, and the regeneration of maize plants from transformed 
protoplasts. Gordon-Kamm et al. ( Plant Cell 2: 603 (1990)) and Fromm et al. (Biotechnology, 
8: 833 (1990)) have published techniques for transformation of A188-derived maize line using 
particle bombardment. Furthermore, WO 93/07278 and Koziel et al. (Biotechnology, H: 194 
(1993)) describe techniques for the transformation of elite inbred lines of maize by particle 
bombardment. This technique utilizes immature maize embryos of 1.5-2.5 mm length excised 
from a maize ear 14-15 days after pollination and a PDS-lOOOHe Biolistics device for 
bombardment. 

Transformation of rice can also be undertaken by direct gene transfer techniques utilizing 
protoplasts or particle bombardment. Protoplast-mediated transformation has been described for 
Japonica-types and Indica-types (Zhang et al., Plant Cell Rep , 7: 379 (1988); Shimamoto et al., 
Nature , 338: 274 (1989); Datta et al., Biotechnology , 8: 736 (1990)). Both types are also 
routinely transformable using particle bombardment (Christou et al., Biotechnology , 9: 957 
(1991)). Furthermore, WO 93/21335 describes techniques for the transformation of rice via 
electroporation. Patent Application EP 0 332 581 describes techniques for the generation, 
transformation and regeneration of Pooideae protoplasts. These techniques allow the 
transformation of Dactylis and wheat. Furthermore, wheat transformation has been described by 
Vasil et al. ( Biotechnology , 10: 667 (1992)) using particle bombardment into cells of type C 
long-term regenerable callus, and also by Vasil et al. ( Biotechnology , U : 1553 (1993)) and 
Weeks et al. ( Plant Physiol. , 102 : 1077 (1993)) using particle bombardment of immature 
embryos and immature embryo-derived callus. A preferred technique for wheat transformation, 
however, involves the transformation of wheat by particle bombardment of immature embryos 
and includes either a high sucrose or a high maltose step prior to gene delivery. Prior to 
bombardment, any number of embryos (0.75-1 mm in length) are plated onto MS medium with 
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3% sucrose (Murashiga & Skoog, Phvsiologia Plantarum , 15: 473 (1962)) and 3 mg/1 2,4-D for 
induction of somatic embryos, which is allowed to proceed in the dark. On the chosen day of 
bombardment, embryos are removed from the induction medium and placed onto the osmoticum 
(i.e., induction medium with sucrose or maltose added at the desired concentration, typically 
15%). The embryos are allowed to plasmolyze for 2-3 hours and are then bombarded. Twenty 
embryos per target plate is typical, although not critical. An appropriate gene-carrying plasmid 
(such as pCEB3064 or pSG35) is precipitated onto micrometer size gold particles using standard 
procedures. Each plate of embryos is shot with the DuPont Biolistics® helium device using a 
burst pressure of about 1000 psi using a standard 80 mesh screen. After bombardment, the 
embryos are placed back into the dark to recover for about 24 hours (still on osmoticum). After 
24 hours, the embryos are removed from the osmoticum and placed back onto induction medium 
where they stay for about a month before regeneration. Approximately one month later the 
embryo explants with developing embryogenic callus are transferred to regeneration medium 
(MS + 1 mg/liter NAA, 5 mg/liter GA), further containing the appropriate selection agent (10 
mg/1 basta in the case of pCEB3064 and 2 mg/1 methotrexate in the case of pSOG35). After 
approximately one month, developed shoots are transferred to larger sterile containers known as 
M GA7s" which contain half-strength MS, 2% sucrose, and the same concentration of selection 
agent. 

Transformation of monocotyledons using Agrobacterium has also been described. See, 
WO 94/00977 and U.S. Patent No. 5,591,616, both of which are incorporated herein by 

reference. 

c. Transformation of Plastids 

Seeds of Nicotiana tabacum c.v. 'Xanthi nc' are germinated seven per plate in a 1" 
circular array on T agar medium and bombarded 12-14 days after sowing with 1 Jim tungsten 
particles (M10, Biorad, Hercules, CA) coated with DNA from plasmids pPH143 and pPH145 
essentially as described (Svab and Maliga, PNAS, 90:913 (1993)). Bombarded seedlings are 
incubated on T medium for two days after which leaves are excised and placed abaxial side up in 
bright light (350-500 |imol photons/m 2 /s) on plates of RMOP medium (Svab, Hajdukiewicz and 
Maliga, PNAS> 87:8526 (1990)) containing 500 fig/ml spectinomycin dihydrochloride (Sigma, 

St. Louis, MO). Resistant shoots appearing underneath the bleached leaves three to eight weeks 
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after bombardment are subcloned onto the same selective medium, allowed to form callus, and 
secondary shoots isolated and subcloned. Complete segregation of transformed plastid genome 
copies (homoplasmicity) in independent subclones is assessed by standard techniques of 
Southern blotting (Sambrook et al., Molecular Cloning: A Laboratory Manual , Cold Spring 
Harbor Laboratory, Cold Spring Harbor (1989)). BamHI/EcoRI-digested total cellular DNA 
(Mettler, I. J. Plant Mol Biol Reporter , 5:346 (1987)) is separated on 1% Tris-borate (TBE) 

32 

agarose gels, transferred to nylon membranes (Amersham) and probed with P-labeled random 
primed DNA sequences corresponding to a 0.7 kb BamHI/Hindlll DNA fragment from pC8 
containing a portion of the rps7/12 plastid targeting sequence. Homoplasmic shoots are rooted 
aseptically on spectinomycin-containing MS/IBA medium (McBride et al., PNAS, £1:7301 
(1994)) and transferred to the greenhouse. 

Production and Characterization of Stably Transformed Plants 

Transformed plant cells are then placed in an appropriate selective medium for selection 
of transgenic cells, which are then grown to callus. Shoots are grown from callus and plantlets 
generated from the shoot by growing in rooting medium. The various constructs normally will 
be joined to a marker for selection in plant cells. Conveniently, the marker may be resistance to 
a biocide (particularly an antibiotic, such as kanamycin, G418, bleomycin, hygromycin, 
chloramphenicol, herbicide, or the like). The particular marker used will allow for selection of 
transformed cells as compared to cells lacking the DNA which has been introduced. 
Components of DNA constructs, including transcription/expression cassettes of this invention, 
may be prepared from sequences, which are native (endogenous) or foreign (exogenous) to the 
host. By "foreign" it is meant that the sequence is not found in the wild-type host into which the 
construct is introduced. Heterologous constructs will contain at least one region, which is not 
native to the gene from which the transcription-initiation-region is derived. 

To confirm the presence of the transgenes in transgenic cells and plants, a Southern blot 
analysis can be performed using methods known to those skilled in the art. Integration of a 
polynucleic acid segment into the genome can be detected and quantitated by Southern blot, 
since they can be readily distinguished from constructs containing the segments through use of 
appropriate restriction enzymes. Expression products of the transgenes can be detected in any of 
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a variety of ways, depending upon the nature of the product, and include Western blot and 
enzyme assay. One particularly useful way to quantitate protein expression and to detect 
replication in different plant tissues is to use a reporter gene, such as GUS. Once transgenic 
plants have been obtained, they may be grown to produce plant tissues or parts having the 
desired phenotype. The plant tissue or plant parts may be harvested, and/or the seed collected. 
The seed may serve as a source for growing additional plants with tissues or parts having the 
desired characteristics. 

The invention thus provides a transformed plant or plant part, such as an ear, seed, fruit, 
grain, stover, chaff, or bagasse comprising at least one polynucleotide, expression cassette or 
vector of the invention, methods of making such a plant and methods of using such a plant or a 
part thereof. The transformed plant or plant part expresses a processing enzyme, optionally 
localized in a particular cellular or subcellular compartment of a certain tissue or in developing 
grain. For instance, the invention provides a transformed plant part comprising at least one 
starch processing enzyme present in the cells of the plant, wherein the plant part is obtained from 
a transformed plant, the genome of which is augmented with an expression cassette encoding the 
at least one starch processing enzyme. The processing enzyme does not act on the target 
substrate unless activated by methods such as heating, grinding, or other methods, which allow 
the enzyme to contact the substrate under conditions where the enzyme is active 

Exemplary Methods of the Present Invention 

The self-processing plants and plant parts of the present invention may be used in various 
methods employing the processing enzymes (mesophilic, thermophilic, or hyperthermophilic) 
expressed and activated therein. In accordance with the present invention, a transgenic plant part 
obtained from a transgenic plant the genome of which is augmented with at least one processing 
enzyme, is placed under conditions in which the processing enzyme is expressed and activated. 
Upon activation, the processing enzyme is activated and functions to act on the substrate in 
which it normally acts to obtained the desired result. For example, the starch-processing 
enzymes act upon starch to degrade, hydrolyze, isomerize, or otherwise modify to obtain the 
desired result upon activation. Non-starch processing enzymes may be used to disrupt the plant 
cell membrane in order to facilitate the extraction of starch, lipids, amino acids, or other products 
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from the plants. Moreover, non-hyperthermophilic and hyperthermophilic enzymes may be used 
in combination in the self-processing plant or plant parts of the present invention. For example, 
a mesophilic non-starch degrading enzyme may be activated to disrupt the plant cell membrane 
for starch extraction, and subsequently, a hyperthermophilic starch-degrading enzyme may then 
be activated in the self-processing plant to degrade the starch. 

Enzymes expressed in grain can be activated by placing the plant or plant part containing 
them in conditions in which their activity is promoted. For example, one or more of the 
following techniques may be used: The plant part may be contacted with water, which provides 
a substrate for a hydrolytic enzyme and thus will activate the enzyme. The plant part may be 
contacted with water which will allow enzyme to migrate from the compartment into which it 
was deposited during development of the plant part and thus to associate with its substrate. 
Movement of the enzyme is possible because compartmentalization is breached during 
maturation, drying of grain and re-hydration. The intact or cracked grain may be contacted with 
water which will allow enzyme to migrate from the compartment into which it was deposited 
during development of the plant part and thus to associate with its substrate. Enzymes can also 
be activated by addition of an activating compound. For example, a calcium-dependent enzyme 
can be activated by addition of calcium. Other activating compounds may determined by those 
skilled in the art. Enzymes can be activated by removal of an inactivator. For example, there are 
known peptide inhibitors of amylase enzymes, the amylase could be co-expressed with an 
amylase inhibitor and then activated by addition of a protease. Enzymes can be activated by 
alteration of pH to one at which the enzyme is most active. Enzymes can also be activated by 
increasing temperature. An enzyme generally increases in activity up to the maximal 
temperature for that enzyme. A mesophilic enzyme will increase in activity from the level of 
activity ambient temperature up to the temperature at which it loses activity which is typically 
less than or equal to 70 °C. Similarly thermophilic and hyperthermophilic enzymes can also be 
activated by increasing temperature. Thermophilic enzymes can be activated by heating to 
temperatures up to the maximal temperature of activity or of stability. For a thermophilic 
enzyme the maximal temperatures of stability and activity will generally be between 70 and 85 
°C. Hyperthermophilic enzymes will have the even greater relative activation than mesophilic or 
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thermophilic enzymes because of the greater potential change in temperature from 25 °C up to 
85 °C to 95 °C or even 100 °C. The increased temperature may be achieved by any method, for 
example by heating such as by baking, boiling, heating, steaming, electrical discharge or any 
combination thereof. Moreover, in plants expressing mesophilic or thermophilic enzyme(s), 
activation of the enzyme may be accomplished by grinding, thereby allowing the enzyme to 
contact the substrate. 

The optimal conditions, e.g., temperature, hydration, pH, etc, may be determined by one 
having skill in the art and may depend upon the individual enzyme being employed and the 
desired application of the enzyme. 

The present invention further provides for the use of exogenous enzymes that may assist 
in a particular process. For example, the use of a self-processing plant or plant part of the 
present invention may be used in combination with an exogenously provided enzyme to facilitate 
the reaction. As an example, transgenic or-amylase corn may be used in combination with other 
starch-processing enzymes, such as pullulanase, a-glucosidase, glucose isomerase, mannanases, 
hemicellulases, etc., to hydrolyze starch or produce ethanol. In fact, it has been found that 
combinations of the transgenic a-amylase corn with such enzymes has unexpectedly provided 
superior degrees of starch conversion relative to the use of transgenic a- amylase corn alone. 

Example of suitable methods contemplated herein are provided. 

a. Starch Extraction From Plants 

The invention provides for a method of facilitating the extraction of starch from plants. 
In particular, at least one polynucleotide encoding a processing enzyme that disrupts the 
physically restraining matrix of the endosperm (cell walls, non-starch polysaccharide, and 
protein matrix) is introduced to a plant so that the enzyme is preferably in close physical 
proximity to starch granules in the plant. In this embodiment of the invention, transformed 
plants express one or more protease, glucanase, xylanase, thioredoxin/thioredoxin reductase, 
cellulase, phytase, lipase, beta glucosidase, esterase and the like, but not enzymes that have any 
starch degrading activity, so as to maintain the integrity of the starch granules. The expression of 
these enzymes in a plant part such as grain thus improves the process characteristics of grain. 
The processing enzyme may be mesophilic, thermophilic, or hyperthermophilic. In one example, 
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grain from a transformed plant of the invention is heat dried, likely inactivating non- 
hyperthermophilic processing enzymes and improving seed integrity. Grain (or cracked grain) is 
steeped at low temperatures or high temperatures (where time is of the essence) with high or low 
moisture content or conditions (see Primary Cereal Processing, Gordon and Willm, eds., pp. 319- 
337 (1994), the disclosure of which is incorporated herein), with or without sulphur dioxide. 
Upon reaching elevated temperatures, optionally at certain moisture conditions, the integrity of 
the endosperm matrix is disrupted by activating the enzymes, e.g., proteases, xylanases, phytase 
or glucanases which degrade the proteins and non-starch polysaccharides present in the 
endosperm leaving the starch granule therein intact and more readily recoverable from the 
resulting material. Further, the proteins and non-starch polysaccharides in the effluent are at 
least partially degraded and highly concentrated, and so may be used for improved animal feed, 
food, or as media components for the fermentation of microorganisms. The effluent is 
considered a corn-steep liquor with improved composition. 

Thus, the invention provides a method to prepare starch granules. The method comprises 
treating grain, for example cracked grain, which comprises at least one non-starch processing 
enzyme under conditions which activate the at least one enzyme, yielding a mixture comprising 
starch granules and non-starch degradation products, e.g., digested endosperm matrix products. 
The non-starch processing enzyme may be mesophilic, thermophilic, or hyperthermophilic. 
After activation of the enzyme, the starch granules are separated from the mixture. The grain is 
obtained from a transformed plant, the genome of which comprises (is augmented with) an 
expression cassette encoding the at least one processing enzyme. For example, the processing 
enzyme may be a protease, glucanase, xylanase, phytase, thiroredoxin/thioredoxin reductase, 
esterase cellulase, lipase, or a beta glucosidase. The processing enzyme may be 
hyperthermophilic. The grain can be treated under low or high moisture conditions, in the 
presence or absence of sulfur dioxide. Depending on the activity and expression level of the 
processing enzyme in the grain from the transgenic plant, the transgenic grain may be mixed 
with commodity grain prior to or during processing. Also provided are products obtained by the 
method such as starch, non-starch products and improved steepwater comprising at least one 

additional component. 

b. Starch-Processing Methods 
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Transformed plants or plant parts of the present invention may comprise starch-degrading 
enzymes as disclosed herein that degrade starch granules to dextrins, other modified starches, or 
hexoses (e.g., a-amylase, pullulanase, a-glucosidase, glucoamylase, amylopullulanase) or 
convert glucose into fructose (e.g., glucose isomerase). Preferably, the starch-degrading enzyme 
is selected from a-amylase, a-glucosidase, glucoamylase, pullulanase, neopullulanase, 
amylopullulanase, glucose isomerase, and combinations thereof is used to transform the grain. 
Moreover, preferably, the enzyme is operably linked to a promoter and to a signal sequence that 
targets the enzyme to the starch granule, an amyloplast, the apoplast, or the endoplasmic 
reticulum. Most preferably, the enzyme is expressed in the endosperm, and particularly, corn 
endosperm, and localized to one or more cellular compartments, or within the starch granule 
itself. The preferred plant part is grain. Preferred plant parts are those from corn, wheat, barley, 
rye, oat, sugar cane, or rice. 

In accordance with one starch-degrading method of the present invention, the 
transformed grain accumulates the starch-degrading enzyme in starch granules, is steeped at 
conventional temperatures of 50°C-60°C, and wet-milled as is known in the art. Preferably, the 
starch-degrading enzyme is hyperthermophilic. Because of sub-cellular targeting of the enzyme 
to the starch granule, or by virtue of the association of the enzyme with the starch granule, by 
contacting the enzyme and starch granule during the wet-milling process at the conventional 
temperatures, the processing enzyme is co-purified with the starch granules to obtain the starch 
granules/enzyme mixture. Subsequent to the recovery of the starch granules/enzyme mixture, 
the enzyme is then activated by providing favorable conditions for the activity of the enzyme. 
For example, the processing may be performed in various conditions of moisture and/or 
temperature to facilitate the partial (in order to make derivatized starches or dextrins) or 
complete hydrolysis of the starch into hexoses. Syrups containing high dextrose or fructose 
equivalents are obtained in this manner. This method effectively reduces the time, energy, and 
enzyme costs and the efficiency with which starch is converted to the corresponding hexose, and 
the efficiency of the production of products, like high sugar steepwater and higher dextrose 
equivalent syrups, are increased. 
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In another embodiment, a plant, or a product of the plant such as a fruit or grain, or flour 
made from the grain that expresses the enzyme is treated to activate the enzyme and convert 
polysaccharides expressed and contained within the plant into sugars. Preferably, the enzyme is 
fused to a signal sequence that targets the enzyme to a starch granule, an amyloplast, the apoplast 
or to the endoplasmic reticulum as disclosed herein. The sugar produced may then be isolated or 
recovered from the plant or the product of the plant. In another embodiment, a processing enzyme 
able to convert polysaccharides into sugars is placed under the control of an inducible promoter 
according to methods known in the art and disclosed herein. The processing enzyme may be 
mesophilic, thermophilic or hyperthermophilic. The plant is grown to a desired stage and the 
promoter is induced causing expression of the enzyme and conversion of the polysaccharides, 
within the plant or product of the plant, to sugars. Preferably the enzyme is operably linked to a 
signal sequence that targets the enzyme to a starch granule, an amyloplast, an apoplast or to the 
endoplasmic reticulum. In another embodiment, a transformed plant is produced that expresses a 
processing enzyme able to convert starch into sugar. The enzyme is fused to a signal sequence that 
targets the enzyme to a starch granule within the plant. Starch is then isolated from the 
transformed plant that contains the enzyme expressed by the transformed plant. The enzyme 
contained in the isolated starch may then be activated to convert the starch into sugar. The enzyme 
may be mesophilic, thermophilic, or hyperthermophilic. Examples of hyperthermophilic enzymes 
able to convert starch to sugar are provided herein. The methods may be used with any plant 
which produces a polysaccharide and that can express an enzyme able to convert a polysaccharide 
into sugars or hydrolyzed starch product such as dextrin, maltooligosaccharide, glucose and/or 
mixtures thereof. 

The invention provides a method to produce dextrins and altered starches from a plant, or 

a product from a plant, that has been transformed with a processing enzyme which hydrolyses 

certain covalent bonds of a polysaccharide to form a polysaccharide derivative. In one 

embodiment, a plant, or a product of the plant such as a fruit or grain, or flour made from the 

grain that expresses the enzyme is placed under conditions sufficient to activate the enzyme and 

convert polysaccharides contained within the plant into polysaccharides of reduced molecular 

weight. Preferably, the enzyme is fused to a signal sequence that targets the enzyme to a starch 

granule, an amyloplast, the apoplast or to the endoplasmic reticulum as disclosed herein. The 
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dextrin or derivative starch produced may then be isolated or recovered from the plant or the 
product of the plant. In another embodiment, a processing enzyme able to convert 
polysaccharides into dextrins or altered starches is placed under the control of an inducible 
promoter according to methods known in the art and disclosed herein. The plant is grown to a 
desired stage and the promoter is induced causing expression of the enzyme and conversion of 
the polysaccharides, within the plant or product of the plant, to dextrins or altered starches. 
Preferably the enzyme is ct-amylase, pullulanase, iso or neo-pullulanase and is operably linked to 
a signal sequence that targets the enzyme to a starch granule, an amyloplast, the apoplast or to 
the endoplasmic reticulum. In one embodiment, the enzyme is targeted to the apoplast or to the 
endoreticulum. In yet another embodiment, a transformed plant is produced that expresses an 
enzyme able to convert starch into dextrins or altered starches. The enzyme is fused to a signal 
sequence that targets the enzyme to a starch granule within the plant. Starch is then isolated 
from the transformed plant that contains the enzyme expressed by the transformed plant. The 
enzyme contained in the isolated starch may then be activated under conditions sufficient for 
activation to convert the starch into dextrins or altered starches. Examples of hyperthermophilic 
enzymes, for example, able to convert starch to hydrolyzed starch products are provided herein. 
The methods may be used with any plant which produces a polysaccharide and that can express 
an enzyme able to convert a polysaccharide into sugar. 

In another embodiment, grain from transformed plants of the invention that accumulate 
starch-degrading enzymes that degrade linkages in starch granules to dextrins, modified starches 
or hexose (e.g., ct-amylase, pullulanase, a-glucosidase, glucoamylase, amylopullulanase) is 
steeped under conditions favoring the activity of the starch degrading enzyme for various periods 
of time. The resulting mixture may contain high levels of the starch-derived product. The use of 
such grain: 1) eliminates the need to mill the grain, or otherwise process the grain to first obtain 
starch granules, 2) makes the starch more accessible to enzymes by virtue of placing the enzymes 
directly within the endosperm tissue of the grain, and 3) eliminates the need for microbially 
produced starch-hydrolyzing enzymes. Thus, the entire process of wet-milling prior to hexose 
recovery is eliminated by simply heating grain, preferably corn grain, in the presence of water to 
allow the enzymes to act on the starch. 
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This process can also be employed for the production of ethanol, high fructose syrups, 
hexose (glucose) containing fermentation media, or any other use of starch that does not require 
the refinement of grain components. 

The invention further provides a method of preparing dextrin, maltooligosaccharides, 
and/or sugar involving treating a plant part comprising starch granules and at least one starch 
processing enzyme under conditions so as to activate the at least one enzyme thereby digesting 
starch granules to form an aqueous solution comprising sugars. The plant part is obtained from a 
transformed plant, the genome of which is augmented with an expression cassette encoding the 
at least one processing enzyme. The aqueous solution comprising dextrins, 
maltooligosaccharides, and/or sugar is then collected. In one embodiment, the processing 
enzyme is a-amylase, a-glucosidase, pullulanase, glucoamylase, amylopullulanase, glucose 
isomerase, or any combination thereof. Preferably, the enzyme is hyperthermophilic. In another 
embodiment, the method further comprises isolating the dextrins, maltooligosaccharides, and/or 
sugar. 

c. Improved Corn Varieties 

The invention also provides for the production of improved corn varieties (and 
varieties of other crops) that have normal levels of starch accumulation, and accumulate 
sufficient levels of amylolytic enzyme(s) in their endosperm, or starch accumulating organ, such 
that upon activation of the enzyme contained therein, such as by boiling or heating the plant or a 
part thereof in the case of a hyperthermophilic enzyme, the enzyme(s) is activated and facilitates 
the rapid conversion of the starch into simple sugars. These simple sugars (primarily glucose) 
will provide sweetness to the treated corn. The resulting corn plant is an improved variety for 
dual use as a grain 

producing hybrid and as sweet corn. Thus, the invention provides a method to produce 
hyper-sweet corn, comprising treating transformed corn or a part thereof, the genome of 
which is augmented with and expresses in endosperm an expression cassette comprising a 
promoter operably linked to a first polynucleotide encoding at least one amylolytic enzyme, 
conditions which activate the at least one enzyme so as to convert polysaccharides in the corn 
into sugar, yielding hypersweet corn. The promoter may be a constitutive promoter, a seed- 
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specific promoter, or an endosperm-specific promoter which is linked to a polynucleotide 
sequence which encodes a processing enzyme such 

as a-amylase, e.g., one comprising SEQ ID NO: 13, 14, or 16. Preferably, the enzyme is 
hyperthermophilic. In one embodiment, the expression cassette further comprises a second 
polynucleotide which encodes a signal sequence operably linked to the enzyme encoded by the 
first polynucleotide. Exemplary signal sequences in this embodiment of the invention direct the 
enzyme to apoplast, the endoplasmic reticulum, a starch granule, or to an amyloplast. The corn 
plant is grown such that the ears with kernels are formed and then the promoter is induced to 
cause the enzyme to be expressed and convert polysaccharide contained within the plant into 
sugar. 

d. Self-Fermenting Plants 

In another embodiment of the invention, plants, such as corn, rice, wheat, or sugar 
cane are engineered to accumulate large quantities of processing enzymes in their cell walls, e.g., 
xylanases, cellulases, hemicellulases, glucanases, pectinases, lipases, esterases, beta 
glucosidases, phytases, proteases and the like (non-starch polysaccharide degrading enzymes). 
Following the harvesting of the grain component (or sugar in the case of sugar cane), the stover, 
chaff, or bagasse is used as a source of the enzyme, which was targeted for expression and 
accumulation in the cell walls, and as a source of biomass. The stover (or other left-over tissue) 
is used as a feedstock in a process to recover fermentable sugars. The process of obtaining the 
fermentable sugars consists of activating the non-starch polysaccharide degrading enzyme. For 
example, activation may comprise heating the plant tissue in the presence of water for periods of 
time adequate for the hydrolysis of the non-starch polysaccharide into the resulting sugars. Thus, 
this self-processing stover produces the enzymes required for conversion of polysaccharides into 
monosaccharides, essentially at no incremental cost as they are a component of the feedstock. 
Further, the temperature-dependent enzymes have no detrimental effects on plant growth and 
development, and cell wall targeting, even targeting into polysaccharide microfibrils by virtue of 
cellulose/xylose binding domains fused to the protein, improves the accessibility of the substrate 
to the enzyme. 

Thus, the invention also provides a method of using a transformed plant part comprising 

at least one non-starch polysaccharide processing enzyme in the cell wall of the cells of the plant 
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part. The method comprises treating a transformed plant part comprising at least one non-starch 
polysaccharide processing enzyme under conditions which activate the at least one enzyme 
thereby digesting starch granules to form an aqueous solution comprising sugars, wherein the 
plant part is obtained from a transformed plant, the genome of which is augmented with an 
expression cassette encoding the at least one non-starch polysaccharide processing enzyme; and 
collecting the aqueous solution comprising the sugars. The invention also includes a transformed 
plant or plant part comprising at least one non-starch polysaccharide processing enzyme present 
in the cell or cell wall of the cells of the plant or plant part. The plant part is obtained from a 
transformed plant, the genome of which is augmented with an expression cassette encoding the 
at least one non-starch processing enzyme, e.g., a xylanase, cellulase, glucanase, pectinase, 
lipase, esterase, beta glucosidase, phytase, protease or any combination thereof 
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e. Aqueous Phase High In Protein and Sugar Content 

In yet another embodiment, proteases and lipases are engineered to accumulate in seeds, 
e.g., soybean seeds. After activation of the protease or lipase, such as, for example, by heating, 
these enzymes in the seeds hydrolyze the lipid and storage proteins present in soybeans during 
processing. Soluble products comprising amino acids, which can be used as feed, food or 
fermentation media, and fatty acids, can thus be obtained. Polysaccharides are typically found in 
the insoluble fraction of processed grain. However, by combining polysaccharide degrading 
enzyme expression and accumulation in seeds, proteins and polysaccharides can be hydrolyzed 
and are found in the aqueous phase. For example, zeins from corn and storage protein and non- 
starch polysaccharides from soybean can be solubilized in this manner. Components of the 
aqueous and hydrophobic phases can be easily separated by extraction with organic solvent or 
supercritical carbon dioxide. Thus, what is provided is a method for producing an aqueous 
extract of grain that contains higher levels of protein, amino acids, sugars or saccharides. 

f. Self-Processing Fermentation 

The invention provides a method to produce ethanol, a fermented beverage, or other 
fermentation-derived product(s). The method involves obtaining a plant, or the product or part of a 
plant, or plant derivative such as grain flour, wherein a processing enzyme that converts 
polysaccharides into sugar is expressed. The plant, or product thereof, is treated such that sugar is 
produced by conversion of the polysaccharide as described above. The sugars and other 
components of the plant are then fermented to form ethanol or a fermented beverage, or other 
fermentation-derived products, according to methods known in the art. See, for example, U.S. 
Patent No.: 4,929,452. Briefly the sugar produced by conversion of polysaccharides is incubated 
with yeast under conditions that promote conversion of the sugar into ethanol. A suitable yeast 
includes high alcohol-tolerant and high-sugar tolerant strains of yeast, such as, for example, the 
yeast, S. cerevisiae ATCC No. 20867. This strain was deposited with the American Type Culture 
Collection, Rockville, MD, on Sept. 17, 1987 and assigned ATCC No. 20867. The fermented 
product or fermented beverage may then be distilled to isolate ethanol or a distilled beverage, or 
the fermentation product otherwise recovered. The plant used in this method may be any plant 
that contains a polysaccharide and is able to express an enzyme of the invention. Many such 
plants are disclosed herein. Preferably the plant is one that is grown commercially. More 
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preferably the plant is one that is normally used to produce ethanol or fermented beverages, or 
fermented products, such as, for example, wheat, barley, corn, rye, potato, grapes or rice. 

The method comprises treating a plant part comprising at least one polysaccharide 
processing enzyme under conditions to activate the at least one enzyme thereby digesting 
polysaccharide in the plant part to form fermentable sugar. The polysaccharide processing 
enzyme may be mesophilic, thermophilic, or hyperthermophilic. The plant part is obtained from 
a transformed plant, the genome of which is augmented with an expression cassette encoding the 
at least one polysaccharide processing enzyme. Plant parts for this embodiment of the invention 
include, but are not limited to, grain, fruit, seed, stalk, wood, vegetable or root. Plants include 
but are not limited to oat, barley, wheat, berry, grape, rye, corn, rice, potato, sugar beet, sugar 
cane, pineapple, grass and tree. The plant part may be combined with commodity grain or other 
commercially available substrates; the source of the substrate for processing may be a source 
other than the self-processing plant. The fermentable sugar is then incubated under conditions 
that promote the conversion of the fermentable sugar into ethanol, e.g., with yeast and/or other 
microbes. In an embodiment, the plant part is derived from corn transformed with a-amylase, 
which has been found to reduce the amount of time and cost of fermentation. 

It has been found that the amount of residual starch is reduced when transgenic corn 
made in accordance with the present invention expressing a thermostable a-amylase, for 
example, is used in fermentation. This indicates that more starch is solubilized during 
fermentation. The reduced amount of residual starch results in the distillers' grains having 
higher protein content by weight and higher value. Moreover, the fermentation of the transgenic 
corn of the present invention allows the liquefaction process to be performed at a lower pH, 
resulting in savings in the cost of chemicals used to adjust the pH, at a higher temperature, e.g., 
greater than 85°C, preferably, greater than 90°C, more preferably, 95°C or higher, resulting in 
shorter liquefaction times and more complete solubilization of starch, and reduction of 
liquefaction times, all resulting in efficient fermentation reactions with higher yields of ethanol. 

Moreover, it has been found that contacting conventional plant parts with even a small 

portion of the transgenic plant made in accordance with the present invention may reduce the 

fermentation time and costs associated therewith. As such, the present invention relates to the 

reduction in the fermentation time for plants comprising subjecting a transgenic plant part from a 
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plant comprising a polysaccharide processing enzyme that converts polysaccharides into sugar 
relative to the use of a plant part not comprising the polysaccharide processing enzyme. 

g. Raw Starch Processing Enzymes And Polynucleotides Encoding Them 
A polynucleotide encoding a mesophilic processing enzyme(s) is introduced into a plant 
or plant part. In an embodiment, the polynucleotide of the present invention is a maize- 
optimized polynucleotide such as provided in SEQ ED NOs: 48, 50, and 59, encoding a 
glucoamylase, such as provided in SEQ ID NOs: 47, and 49. In another embodiment, the 
polynucleotide of the present invention is a maize-optimized polynucleotide such as provided in 
SEQ ID NO: 52, encoding an alpha-amylase, such as provided in SEQ ID NO: 51. Moreover, 
fusion products of processing enzymes are further contemplated. In one embodiment, the 
polynucleotide of the present invention is a maize-optimized polynucleotide such as provided in 
SEQ ID NO: 46, encoding an alpha-amylase and glucoamylase fusion, such as provided in SEQ 
ID NO: 45. Combinations of processing enzymes are further envisioned by the present 
invention. For example, a combination of starch-processing enzymes and non-starch processing 
enzymes is contemplated herein. Such combinations of processing enzymes may be obtained by 
employing the use of multiple gene constructs encoding each of the enzymes. Alternatively, the 
individual transgenic plants stably transformed with the enzymes may be crossed by known 
methods to obtain a plant containing both enzymes. Another method includes the use of 
exogenous enzyme(s) with the transgenic plant. 

The source of the starch-processing and non-starch processing enzymes may be isolated 
or derived from any source and the polynucleotides corresponding thereto may be ascertained by 
one having skill in the art. The or-amylase may be derived from Aspergillus (e.g., Aspergillus 
shirousami and Aspergillus niger), Rhizopus (eg., Rhizopus oryzae), and plants such as corn, 
barley, and rice. The glucoamylase may be derived from Aspergillus (e.g., Aspergillus 
shirousami and Aspergillus niger) p Rhizopus (eg., Rhizopus oryzae), and Thermoanaerobacter 
(eg., Thermoanaerobacter thermosaccharolyticum). 

In another embodiment of the invention, the polynucleotide encodes a mesophilic starch- 
processing enzyme that is operably linked to a maize-optimized polynucleotide such as provided 
in SEQ ID NO: 54, encoding a raw starch binding domain, such as provided in SEQ ID NO: 53. 
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In another embodiment, a tissue-specific promoter includes the endosperm-specific 
promoters such as the maize 7-zein promoter (exemplified by SEQ ID NO: 12) or the maize 
ADP-gpp promoter (exemplified by SEQ ID NO:l 1, which includes a 5' untranslated and an 
intron sequence) or a Q protein promoter (exemplified by SEQ ID NO: 98) or a rice glutelin 
promoter (exemplified by SEQ ID NO: 67) . Thus, the present invention includes an isolated 
polynucleotide comprising a promoter comprising SEQ ID NO: 11, 12, 67, or 98, a 
polynucleotide which hybridizes to the complement thereof under low stringency hybridization 
conditions, or a fragment thereof which has promoter activity, e.g., at least 10%, and preferably 
at least 50%, the activity of a promoter having SEQ ID NO:l 1, 12, 67 or 98. 

In one embodiment, the product from a starch-hydrolysis gene, such as a-amylase, 
glucoamylase, or ot-amylase/glucoamylase fusion may be targeted to a particular organelle or 
location such as the endoplasmic reticulum or apoplast, rather than to the cytoplasm. This is 
exemplified by the use of the maize y-zein N-terminal signal sequence (SEQ ID NO: 17), which 
confers apoplast-specific targeting of proteins, and the use of the y-zein N-terminal signal 
sequence (SEQ ID NO: 17) which is operably linked to the processing enzyme that is operably 
linked to the sequence SEKDEL for retention in the endoplasmic reticulum. Directing the 
protein or enzyme to a specific compartment will allow the enzyme to be localized in a manner 
that it will not come into contact with the substrate. In this manner the enzymatic action of the 
enzyme will not occur until the enzyme contacts its substrate. The enzyme can be contacted with 
its substrate by the process of milling (physical disruption of the cell integrity) and hydrating. 
For example, a mesophilic starch-hydrolyzing enzyme can be targeted to the apoplast or to the 
endoplasmic reticulum and will therefore not come into contact with starch granules in the 
amyloplast. Milling of the grain will disrupt the integrity of the grain and the starch hydrolyzing 
enzyme will then contact the starch granules. In this manner the potential negative effects of co- 
localization of an enzyme and its substrate can be circumvented. 

h. Food Products Without Added Sweetener 

Also provided is a method to produce a sweetened farinaceous food product 
without adding additional sweetener. Examples of farinaceous products include, but are not 
limited to, breakfast food, ready to eat food, baked food, pasta and cereal products such as 
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breakfast cereal. The method comprises treating a plant part comprising at least one starch 
processing enzyme under conditions which activate the starch processing enzyme, thereby 
processing starch granules in the plant part to sugars so as to form a sweetened product, e.g., 
relative to the product produced by processing starch granules from a plant part which does not 
comprise the hyperthermophilic enzyme. Preferably, the starch processing enzyme is 
hyperthermophilic and is activated by heating, such as by baking, boiling, heating, steaming, 
electrical discharge, or any combination thereof. The plant part is obtained from a transformed 
plant, for instance from transformed soybean, rye, oat, barley, wheat, corn, rice or sugar cane, the 
genome of which is augmented with an expression cassette encoding the at least one 
hyperthermophilic starch processing enzyme, e.g., a-amylase, a-glucosidase, glucoamylase, 
pullulanase, glucose isomerase, or any combination thereof. The sweetened product is then 
processed into a farinaceous food product. The invention also provides a farinaceous food 
product, e.g., a cereal food, a breakfast food, a ready to eat food, or a baked food, produced by 
the method. The farinaceous food product may be formed from the sweetened product and 
water, and may contain malt, flavorings, vitamins, minerals, coloring agents or any combination 
thereof. 

The enzyme may be activated to convert polysaccharides contained within the plant 
material into sugar prior to inclusion of the plant material into the cereal product or during the 
processing of the cereal product. Accordingly, polysaccharides contained within the plant 
material may be converted into sugar by activating the material, such as by heating in the case of 
a hyperthermophilic enzyme, prior to inclusion in the farinaceous product. The plant material 
containing sugar produced by conversion of the polysaccharides is then added to the product to 
produce a sweetened product. Alternatively, the polysaccharides may be converted into sugars 
by the enzyme during the processing of the farinaceous product. Examples of processes used to 
make cereal products are well known in the art and include heating, baking, boiling and the like 
as described in U.S. Patent Nos.: 6,183,788; 6,159,530; 6,149,965; 4,988,521 and 5,368,870. 

Briefly, dough may be prepared by blending various dry ingredients together with water 
and cooking to gelatinize the starchy components and to develop a cooked flavor. The cooked 
material can then be mechanically worked to form a cooked dough, such as cereal dough. The 
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dry ingredients may include various additives such as sugars, starch, salt, vitamins, minerals, 
colorings, flavorings, salt and the like. In addition to water, various liquid ingredients such as 
corn (maize) or malt syrup can be added. The farinaceous material may include cereal grains, 
cut grains, grits or flours from wheat, rice, corn, oats, barley, rye, or other cereal grains and 
mixtures thereof from that a transformed plant of the invention. The dough may then be 
processed into a desired shape through a process such as extrusion or stamping and further 
cooked using means such as a James cooker, an oven or an electrical discharge device. 

Further provided is a method to sweeten a starch containing product without 
adding sweetener. The method comprises treating starch comprising at least one 
starch processing enzyme conditions to activate the at least one enzyme thereby digesting the 
starch to form a sugar thereby forming a treated (sweetened) starch, e.g., relative to the product 
produced by treating starch which does not comprise the hyperthermophilic enzyme. The starch 
of the invention is obtained from a transformed plant, the genome of which is augmented with an 
expression cassette encoding the at least one processing enzyme. Enzymes include cx-amylase, 
a-glucosidase, glucoamylase, pullulanase, glucose isomerase, or any combination thereof. The 
enzyme may be hyperthermophilic and activated with heat. Preferred transformed plants include 
corn, soybean, rye, oat, barley, wheat, rice and sugar cane. The treated starch is then added to a 
product to produce a sweetened starch containing product, e.g., a farinaceous food product. Also 
provided is a sweetened starch containing product produced by the method. 

The invention further provides a method to sweeten a polysaccharide containing 
fruit or vegetable comprising: treating a fruit or vegetable comprising at least one 
polysaccharide processing enzyme under conditions which activate the at least one enzyme, 
thereby processing the polysaccharide in the fruit or vegetable to form sugar, yielding a 
sweetened fruit or vegetable, e.g., relative to a fruit or vegetable from a plant which does not 
comprise the polysaccharide processing enzyme. The fruit or vegetable of the invention is 
obtained from a transformed plant, the genome of which is augmented with an expression 
cassette encoding the at least one polysaccharide processing enzyme. 
Fruits and vegetables include potato, tomato, banana, squash, pea, and bean. 
Enzymes include oc-amylase, a-glucosidase, glucoamylase, pullulanase, glucose 
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isomerase, or any combination thereof. The enzyme may be hyperthermophilic. 
i. Sweetening a polysaccharide containing plant or plant product 

The method involves obtaining a plant that expresses a polysaccharide processing 
enzyme which converts a polysaccharide into a sugar as described above. Accordingly the 
enzyme is expressed in the plant and in the products of the plant, such as in a fruit or vegetable. 
In one embodiment, the enzyme is placed under the control of an inducible promoter such that 
expression of the enzyme may be induced by an external stimulus. Such inducible promoters 
and constructs are well known in the art and are described herein. Expression of the enzyme 
within the plant or product thereof causes polysaccharide contained within the plant or product 
thereof to be converted into sugar and to sweeten the plant or product thereof. In another 
embodiment, the polysaccharide processing enzyme is constitutively expressed. Thus, the plant 
or product thereof may be activated under conditions sufficient to activate the enzyme to convert 
the polysaccharides into sugar through the action of the enzyme to sweeten the plant or product 
thereof. As a result, this self-processing of the polysaccharide in the fruit or vegetable to form 
sugar yields a sweetened fruit or vegetable, e.g., relative to a fruit or vegetable from a plant 
which does not comprise the polysaccharide processing enzyme. The fruit or vegetable of the 
invention is obtained from a transformed plant, the genome of which is augmented with an 
expression cassette encoding the at least one polysaccharide processing enzyme. Fruits and 
vegetables include potato, tomato, banana, squash, pea, and bean. Enzymes include oc-amylase, 
a- glucosidase, glucoamylase, pullulanase, glucose isomerase, or any combination thereof. The 
polysaccharide processing enzyme may be hyperthermophilic. 

j. Isolation of starch from transformed grain that contains a enzyme which 
disrupts the endosperm matrix 

The invention provides a method to isolate starch from a transformed grain wherein an 
enzyme is expressed that disrupts the endosperm matrix. The method involves obtaining a plant 
that expresses an enzyme which disrupts the endosperm matrix by modification of, for example, 
cell walls, non-starch polysaccharides and/or proteins. Examples of such enzymes include, but 
are not limited to, proteases, glucanases, thioredoxin, thioredoxin reductase, phytases, lipases, 
cellulases, beta glucosidases, xylanases and esterases. Such enzymes do not include any enzyme 
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that exhibits starch-degrading activity so as to maintain the integrity of the starch granules. The 
enzyme may be fused to a signal sequence that targets the enzyme to the starch granule. In one 
embodiment the grain is heat dried to activate the enzyme and inactivate the endogenous 
enzymes contained within the grain. The heat treatment causes activation of the enzyme, which 
acts to disrupt the endosperm matrix which is then easily separated from the starch granules. In 
another embodiment, the grain is steeped at low or high temperature, with high or low moisture 
content, with or without sulfur dioxide. The grain is then heat treated to disrupt the endosperm 
matrix and allow for easy separation of the starch granules. In another embodiment, proper 
temperature and moisture conditions are created to allow proteases to enter into the starch 
granules and degrade proteins contained within the granules. Such treatment would produce 
starch granules with high yield and little contaminating protein. 

k. Svrup having a high sugar equivalent and use of the syrup to produce ethanol 

or a fermented beverage 

The method involves obtaining a plant that expresses a polysaccharide processing 
enzyme which converts a polysaccharide into a sugar as described above. The plant, or product 
thereof, is steeped in an aqueous stream under conditions where the expressed enzyme converts 
polysaccharide contained within the plant, or product thereof, into dextrin, maltooligosaccharide, 
and/or sugar. The aqueous stream containing the dextrin, maltooligosaccharide, and/or sugar 
produced through conversion of the polysaccharide is then separated to produce a syrup having a 
high sugar equivalent. The method may or may not include an additional step of wet-milling the 
plant or product thereof to obtain starch granules. Examples of enzymes that may be used within 
the method include, but are not limited to, ct-amylase, glucoamylase, pullulanase and a- 
glucosidase. The enzyme may be hyperthermophilic. Sugars produced according to the method 
include, but are not limited to, hexose, glucose and fructose. Examples of plants that may be 
used with the method include, but are not limited to, corn, wheat or barley. Examples of 
products of a plant that may be used include, but are not limited to, fruit, grain and vegetables. In 
one embodiment, the polysaccharide processing enzyme is placed under the control of an 
inducible promoter. Accordingly, prior to or during the steeping process, the promoter is 
induced to cause expression of the enzyme, which then provides for the conversion of 
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polysaccharide into sugar. Examples of inducible promoters and constructs containing them are 
well known in the art and are provided herein. Thus, where the polysaccharide processing is 
hyperthermophilic, the steeping is performed at a high temperature to activate the 
hyperthermophilic enzyme and inactivate endogenous enzymes found within the plant or product 
thereof In another embodiment, a hyperthermophilic enzyme able to convert polysaccharide 
into sugar is constitutively expressed. This enzyme may or may not be targeted to a 
compartment within the plant through use of a signal sequence. The plant, or product thereof, is 
steeped under high temperature conditions to cause the conversion of polysaccharides contained 

within the plant into sugar. 

Also provided is a method to produce ethanol or a fermented beverage from syrup having 
a high sugar equivalent. The method involves incubating the syrup with yeast under conditions 
that allow conversion of sugar contained within the syrup into ethanol or a fermented beverage. 
Examples of such fermented beverages include, but are not limited to, beer and wine. 
Fermentation conditions are well known in the art and are described in U.S. Patent No.: 
4,929,452 and herein. Preferably the yeast is a high alcohol-tolerant and high-sugar tolerant strain 
of yeast such as S. cerevisiae ATCC No. 20867. The fermented product or fermented beverage 
may be distilled to isolate ethanol or a distilled beverage. 

1. Accumulation of hyperthermophilic enzyme in the cell wall of a plant 
The invention provides a method to accumulate a hyperthermophilic enzyme in the cell 
wall of a plant. The method involves expressing within a plant a hyperthermophilic enzyme that 
is fused to a cell wall targeting signal such that the targeted enzyme accumulates in the cell wall. 
Preferably the enzyme is able to convert polysaccharides into monosaccharides. Examples of 
targeting sequences include, but are not limited to, a cellulose or xylose binding domain. 
Examples of hyperthermophilic enzymes include those listed in SEQ ID NO: 1, 3, 5, 10, 13, 14, 
15 or 16. Plant material containing cell walls may be added as a source of desired enzymes in a 
process to recover sugars from the feedstock or as a source of enzymes for the conversion of 
polysaccharides originating from other sources to monosaccharides. Additionally, the cell walls 
may serve as a source from which enzymes may be purified. Methods to purify enzymes are 
well known in the art and include, but are not limited to, gel filtration, ion-exchange 
chromatography, chromatofocusing, isoelectric focusing, affinity chromatography, FPLC, 
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HPLC, salt precipitation, dialysis, and the like. Accordingly, the invention also provides purified 
enzymes isolated from the cell walls of plants. 

m. Method of preparing and isolating processing enzymes 

In accordance with the present invention, recombinantly-produced processing enzymes of 
the present invention may be prepared by transforming plant tissue or plant cell comprising the 
processing enzyme of the present invention capable of being activated in the plant, selected for 
the transformed plant tissue or cell, growing the transformed plant tissue or cell into a 
transformed plant, and isolating the processing enzyme from the transformed plant or part 
thereof. The recombinantly-produced enzyme may be an a-amylase, glucoamylase, glucose 
isomerase, ot-glucosidase, pullulinase, xylanase, protease, glucanase, beta glucosidase, esterase, 
lipase, or phytase. The enzyme may be encoded by the polynucleotide selected from any of 
SEQ ID NO: 2, 4, 6, 9, 19, 21, 25, 37, 39, 41, 43, 46, 48, 50, 52, 59, 61, 63, 65, 79, 81, 83, 85, 
87, 89, 91, 93, 94, 95, 96, 97, or 99. 

The invention will be further described by the following examples, which are not 
intended to limit the scope of the invention in any manner. 

Examples 

Example 1 

Construction of maize-optimized genes for hyperthermophilic starch- 

processing/isomerization enzymes 

The enzymes, a-amylase, pullulanase, o-glucosidase, and glucose isomerase, involved in 
starch degradation or glucose isomerization were selected for their desired activity profiles. 
These include, for example, minimal activity at ambient temperature, high temperature 
activity/stability, and activity at low pH. The corresponding genes were then designed by using 
maize preferred codons as described in U.S. Patent No. 5,625,136 and synthesized by Integrated 
DNA Technologies, Inc. (Coralville, IA). 

The 797GL3 a-amylase, having the amino acid sequence SEQ ED NO:l, was selected for 
its hyperthermophilic activity. This enzyme's nucleic acid sequence was deduced and maize- 
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optimized as represented in SEQ ID NO:2. Similarly, the 6gp3 pullulanase was selected having 
the amino acid sequence set forth in SEQ ID NO:3. The nucleic acid sequence for the 6gp3 
pullulanase was deduced and maize-optimized as represented in SEQ ID NO:4. 

The amino acid sequence for malA a-glucosidase from Sulfolobus solfataricus was 
obtained from the literature, J. Bact. 177:482-485 (1995); J. Bact. 180:1287-1295 (1998). Based 
on the published amino acid sequence of the protein (SEQ ID NO:5), the maize-optimized 
synthetic gene (SEQ ID NO:6) encoding the malA a-glucosidase was designed. 

Several glucose isomerase enzymes were selected. The amino acid sequence (SEQ ID 
NO: 18) for glucose isomerase derived from Thermotoga maritima was predicted based on the 
published DNA sequence having Accession No. NC_000853 and a maize-optimized synthetic 
gene was designed (SEQ ED NO: 19). Similarly the amino acid sequence (SEQ ID NO:20) for 
glucose isomerase derived from Thermotoga neapolitana was predicted based on the published 
DNA sequence from Appl. Envir. Microbiol. 61(5):1867-1875 (1995), Accession No. L38994. 
A maize-optimized synthetic gene encoding the Thermotoga neapolitana glucose isomerase was 
designed (SEQ ID NO:21). 

Example 2 

Expression of fusion of 797GL3 a-amvlase and starch encapsulating region in K coli 
A construct encoding hyperthermophilic 797GL3 ct-amylase fused to the starch 
encapsulating region (SER) from maize granule-bound starch synthase (waxy) was introduced 
and expressed in E. coli. The maize granule-bound starch synthase cDNA (SEQ ED NO:7) 
encoding the amino acid sequence (SEQ ID NO:8)(Klosgen RB, et al. 1986) was cloned as a 
source of a starch binding domain, or starch encapsulating region (SER). The full-length cDNA 
was amplified by RT-PCR from RNA prepared from maize seed using primers SV57 
(5 ' AGCG AATTC ATGGCGGCTCTGGCC ACGT 3') (SEQ ID NO: 22) and SV58 
( 5 ' AGCT AAGCTTC AGGGCGCGGCC ACGTTCT 3') (SEQ ID NO: 23) designed from 
GenBank Accession No. X03935. The complete cDNA was cloned into pBluescript as an 
EcoRI/Hindlll fragment and the plasmid designated pNOV4022. 
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The C-terminal portion (encoded by bp 919-1818) of the waxy cDNA, including the 
starch-binding domain, was amplified from pNOV4022 and fused in-frame to the 3' end of the 
full-length maize-optimized 797GL3 gene (SEQ ID NO:2). The fused gene product, 
797GL3AVaxy, having the nucleic acid SEQ ED NO:9 and encoding the amino acid sequence, 
SEQ ID NO: 10, was cloned as an Ncol/Xbal fragment into pET28b (NOVAGEN, Madison, WI) 
that was cut with Ncol/Nhel. The 797GL3 gene alone was also cloned into the pET28b vector as 

an Ncol/Xbal fragment. 

The pET28/797GL3 and the pET28/797GL3AVaxy vectors were transformed into 
BL21/DE3 E. coli cells (NOVAGEN) and grown and induced according to the manufacturer's 
instruction. Analysis by PAGE/Coomassie staining revealed an induced protein in both extracts 
corresponding to the predicted sizes of the fused and unfused amylase, respectively. 

Total cell extracts were analyzed for hyperthermophilic amylase activity as follows: 5 
mg of starch was suspended in 20 \x\ of water then diluted with 25 |nl of ethanol. The standard 
amylase positive control or the sample to be tested for amylase activity was added to the mixture 
and water was added to a final reaction volume of 500 |al. The reaction was carried out at 80°C 
for 15-45 minutes. The reaction was then cooled down to room temperature, and 500 |al of o- 
dianisidine and glucose oxidase/peroxidase mixture (Sigma) was added. The mixture was 
incubated at 37°C for 30 minutes. 500 \x\ of 12 N sulfuric acid was added to stop the reaction. 
Absorbance at 540 nm was measured to quantitate the amount of glucose released by the 
amylase/sample. Assay of both the fused and unfused amylase extracts gave similar levels of 
hyperthermophilic amylase activity, whereas control extracts were negative. This indicated that 
the 797GL3 amylase was still active (at high temperatures) when fused to the C-terminal portion 
of the waxy protein. 

Example 3 

Isolation of promoter fragments for endosperm-specific expressio n in maize. 
The promoter and 5' noncoding region I (including the first intron) from the large subunit 
of Zea mays ADP-gpp (ADP-glucose pyrophosphorylase) was amplified as a 1515 base pair 
fragment (SEQ ID NO:l 1) from maize genomic DNA using primers designed from Genbank 
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accession M8 1603. The ADP-gpp promoter has been shown to be endosperm-specific (Shaw 
and Hannah, 1992). 

The promoter from the Zea mays y-zein gene was amplified as a 673 bp fragment (SEQ 
ED NO: 12) from plasmid pGZ27.3 (obtained from Dr. Brian Larkins). The y-zein promoter has 
been shown to be endosperm-specific (Torrent et al. 1997). 

Example 4 

Construction of transformation vectors for the 797GL3 hvperthermophilic q-amvlase 

Expression cassettes were constructed to express the 797GL3 hyperthermophilic amylase 
in maize endosperm with various targeting signals as follows: 

pNOV6200 (SEQ ID NO: 13) comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) fused to the synthetic 797GL3 amylase as 
described above in Example 1 for targeting to the endoplasmic reticulum and secretion into the 
apoplast (Torrent et al. 1997). The fusion was cloned behind the maize ADP-gpp promoter for 
expression specifically in the endosperm. 

pNOV6201 (SEQ ED NO: 14) comprises the y-zein N-terminal signal sequence fused to 
the synthetic 797GL3 amylase with a C-terminal addition of the sequence SEKDEL for targeting 
to and retention in the endoplasmic reticulum (ER) (Munro and Pelham, 1987). The fusion was 
cloned behind the maize ADP-gpp promoter for expression specifically in the endosperm. 

pNOV7013 comprises the y-zein N-terminal signal sequence fused to the synthetic 
797GL3 amylase with a C-terminal addition of the sequence SEKDEL for targeting to and 
retention in the endoplasmic reticulum (ER). PNOV7013 is the same as pNOV6201, except that 
the the maize y- zein promoter (SEQ ID NO: 12) was used instead of the maize ADP-spp 
promoter in order to express the fusion in the endosperm. 

pNOV4029 (SEQ ED NO: 15) comprises the waxy amyloplast targeting peptide (Klosgen 
et al., 1986) fused to the synthetic 797GL3 amylase for targeting to the amyloplast. The fusion 
was cloned behind the maize ADP-gpp promoter for expression specifically in the endosperm. 
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pNOV4031 (SEQ ID NO: 16) comprises the waxy amyloplast targeting peptide fused to 
the synthetic 797GL3/waxy fusion protein for targeting to starch granules. The fusion was 
cloned behind the maize ADP-gpp promoter for expression specifically in the endosperm. 

Additional constructs were made with these fusions cloned behind the maize y-zein 
promoter to obtain higher levels of enzyme expression. All expression cassettes were moved 
into a binary vector for transformation into maize via Agrobacterium infection. The binary 
vector contained the phosphomannose isomerase (PMI) gene which allows for selection of 
transgenic cells with mannose. Transformed maize plants were either self-pollinated or 
outcrossed and seed was collected for analysis. 

Additional constructs were made with the targeting signals described above fused to 
either 6gp3 pullulanase or to 340gl2 ot-glucosidase in precisely the same manner as described 
for the a-amylase. These fusions were cloned behind the maize ADP-gpp promoter and/or the y- 
zein promoter and transformed into maize as described above. Transformed maize plants were 
either self-pollinated or outcrossed and seed was collected for analysis. 

Combinations of the enzymes can be produced either by crossing plants expressing the 
individual enzymes or by cloning several expression cassettes into the same binary vector to 
enable cotransformation. 

Example 5 

Construction of plant transformation vectors for the 6GP3 thermophilic pullulanase 
An expression cassette was constructed to express the 6GP3 thermophillic pullanase in 
the endoplasmic reticulum of maize endosperm as follows: 

pNOV7005 (SEQ ID NOs:24 and 25) comprises the maize y-zein N-terminal signal 
sequence fused to the synthetic 6GP3 pullulanase with a C-terminal addition of the sequence 
SEKDEL for targeting to and retention in the ER. The amino acid peptide SEKDEL was fused 
to the C-terminal end of the enzymes using PCR with primers designed to amplify the synthetic 
gene and simultaneously add the 6 amino acids at the C-terminal end of the protein. The fusion 
was cloned behind the maize y-zein promoter for expresson specifically in the endosperm. 

Example 6 

Construction of plant transformation vectors for the malA 
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hvperthermophilic a-glucosidase 

Expression cassettes were constructed to express the Sulfolobus solfataricus malA 
hyperthermophilic cx-glucosidase in maize endosperm with various targeting signals as follows: 

pNOV4831 (SEQ ID NO:26) comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) fused to the synthetic malA a-glucosidase 
with a C-terminal addition of the sequence SEKDEL for targeting to and retention in the 
endoplasmic reticulum (ER) (Munro and Pelham, 1987). The fusion was cloned behind the 
maize y-zein promoter for expresson specifically in the endosperm. 

pNOV4839 (SEQ ED NO:27) comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ED NO: 17) fused to the synthetic malA a-glucosidase for 
targeting to the endoplasmic reticulum and secretion into the apoplast (Torrent et al. 1997). The 
fusion was cloned behind the maize y-zein promoter for expression specifically in the 
endosperm. 

pNOV4837 comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) fused to the synthetic malA a-glucosidase 
with a C-terminal addition of the sequence SEKDEL for targeting to and retention in the ER. 
The fusion was cloned behind the maize ADPgpp promoter for expression specifically in the 
endosperm. The amino acid sequence for this clone is identical to that of pNOV4831 (SEQ ID 
NO:26). 
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Example 7 

Construction of plant transformation vectors for the hyperthermophillic 
Thermotoga maritima and Thermotoga neapolitana glucose isomerases 

Expression cassettes were constructed to express the Thermotoga maritima and 
Thermotoga neapolitana hyperthermophilic glucose isomerases in maize endosperm with 
various targeting signals as follows: 

pNOV4832 (SEQ ID NO:28) comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) fused to the synthetic Thermotoga maritima 
glucose isomerase with a C-terminal addition of the sequence SEKDEL for targeting to and 
retention in the ER. The fusion was cloned behind the maize y-zein promoter for expression 
specifically in the endosperm. 

pNOV4833 (SEQ ED NO:29) comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) fused to the synthetic Thermotoga 
neapolitana glucose isomerase with a C-terminal addition of the sequence SEKDEL for targeting 
to and retention in the ER. The fusion was cloned behind the maize y-zein promoter for 
expression specifically in the endosperm. 

pNOV4840 (SEQ ED NO:30) comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ED NO: 17) fused to the synthetic Thermotoga 
neapolitana glucose isomerase for targeting to the endoplasmic reticulum and secretion into the 
apoplast (Torrent et al. 1997). The fusion was cloned behind the maize y-zein promoter for 
expression specifically in the endosperm. 

pNOV4838 comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) fused to the synthetic Thermotoga neapolitana 
glucose isomerase with a C-terminal addition of the sequence SEKDEL for targeting to and 
retention in the ER. The fusion was cloned behind the maize ADPgpp promoter for expression 
specifically in the endosperm. The amino acid sequence for this clone is identical to that of 
pNOV4833 (SEQ ED NO:29). 
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Example 8 

Construction of plant transformation vectors for the 
expression of the hvperthermophillic glucanase EglA 
pNOV4800 (SEQ ID NO:58) comprises the barley alpha amylase AMY32b signal 
sequence (MGKNGNLCCFSLLLLLLAGLASGHQ)(SEQ ED NO:31) fused with the EglA 
mature protein sequence for localization to the apoplast. The fusion was cloned behind the 
maize y-zein promoter for expression specifically in the endosperm. 

Example 9 

Construction of plant transformation vectors for the 
expression of multiple hvperthermophillic enzymes 

pNOV4841 comprises a double gene construct of a 797GL3 a-amylase fusion and a 
6GP3 pullulanase fusion. Both 797GL3 ftision (SEQ ID NO:33) and 6GP3 fusion (SEQ ID 
NO:34) possessed the maize y-zein N-terminal signal sequence and SEKDEL sequence for 
targeting to and retention in the ER. Each fusion was cloned behind a separate maize y-zein 
promoter for expression specifically in the endosperm. 

pNOV4842 comprises a double gene construct of a 797GL3 a-amylase fusion and a 
malA ot-glucosidase fusion. Both the 797GL3 fusion polypeptide (SEQ ID NO:35) and malA a- 
glucosidase fusion polypeptide (SEQ ID NO:36) possess the maize y-zein N-terminal signal 
sequence and SEKDEL sequence for targeting to and retention in the ER. Each fusion was 
cloned behind a separate maize y-zein promoter for expression specifically in the endosperm. 

pNOV4843 comprises a double gene construct of a 797GL3 a-amylase fusion and a 
malA a-glucosidase fusion. Both the 797GL3 fusion and malA a-glucosidase fusion possess the 
maize y-zein N-terminal signal sequence and SEKDEL sequence for targeting to and retention in 
the ER. The 797GL3 fusion was cloned behind the maize y-zein promoter and the malA fusion 
was cloned behind the maize ADPgpp promoter for expression specifically in the endosperm. 
The amino acid sequences of the 797GL3 fusion and the malA fusion are identical to those of 
pNOV4842 (SEQ ID Nos: 35 and 36, respectively). 

pNOV4844 comprises a triple gene construct of a 797GL3 a-amylase fusion, a 6GP3 
pullulanase fusion, and a malA a-glucosidase fusion. 797GL3, malA, and 6GP3 all possess the 
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maize y-zein N-terminal signal sequence and SEKDEL sequence for targeting to and retention in 
the ER. The 797GL3 and malA fusions were cloned behind 2 separate maize y-zein promoters, 
and the 6GP3 fusion was cloned behind the maize ADPgpp promoter for expression specifically 
in the endosperm. The amino acid sequences for the 797GL3 and malA fusions are identical to 
those of pNOV4842 (SEQ ID Nos: 35 and 36, respectively). The amino acid sequence for the 
6GP3 fusion is identical to that of the 6GP3 fusion in pNOV4841 (SEQ ID NO:34). 

All expression cassettes set forth in this Example as well as in the Examples that follow 
were moved into the binary vector pNOV21 17 for transformation into maize via Agrobacterium 
infection. pNOV21 17 contains the phosphomannose isomerase (PMI) gene allowing for 
selection of transgenic cells with mannose. pNOV21 17 is a binary vector with both the pVSl 
and ColEl origins of replication. This vector contains the constitutive VirG gene from pAD1289 
(Hansen, G., et al., PNAS USA 91 :7603-7607 (1994), incorporated by reference herein) and a 
spectinomycin resistance gene from Tn7. Cloned into the polylinker between the right and left 
borders are the maize ubiquitin promoter, PMI coding region and nopaline synthase terminator of 
pNOVl 17 (Negrotto, D., et al, Plant Cell Reports 19:798-803 (2000), incorporated by reference 
herein). Transformed maize plants will either be self-pollinated or outcrossed and seed collected 
for analysis. Combinations of the different enzymes can be produced either by crossing plants 
expressing the individual enzymes or by transforming a plant with one of the multi-gene 
cassettes. 

Example 10 

Construction of bacterial and Pichia expression vectors 
Expression cassettes were constructed to express the hyperthermophilic a-glucosidase 

and glucose isomerases in either Pichia or bacteria as follows: 

pNOV4829 (SEQ ED NOS: 37 and 38) comprises a synthetic Thermotoga maritima 

glucose isomerase fusion with ER retention signal in the bacterial expression vector pET29a. 

The glucose isomerase fusion gene was cloned into the Ncol and SacI sites of pET29a, which 

results in the addition of an N-terminal S-tag for protein purification. 

pNOV4830 (SEQ ED NOS: 39 and 40) comprises a synthetic TJiermotoga neapolitana 

glucose isomerase fusion with ER retention signal in the bacterial expression vector pET29a. 
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The glucose isomerase fusion gene was cloned into the Ncol and Sad sites of pET29a, which 
results in the addition of an N-terminal S-tag for protein purification. 

pNOV4835 (SEQ ED NO: 41 and 42) comprises the synthetic Thermotoga maritima 
glucose isomerase gene cloned into the BamHI and EcoRI sites of the bacterial expression vector 
pET28C. This resulted in the fusion of a His-tag (for protein purification) to the N-terminal end 

of the glucose isomerase. 

pNOV4836 (SEQ ID NO: 43 AND 44) comprises the synthetic Thermotoga neapolitana 
glucose isomerase gene cloned into the BamHI and EcoRI sites of the bacterial expression vector 
pET28C. This resulted in the fusion of a His-tag (for protein purification) to the N-terminal end 
of the glucose isomerase. 

Example 11 

Transformation of immature maize embryos was performed essentially as described in 
Negrotto et al., Plant Cell Reports 19: 798-803. For this example, all media constituents are as 
described in Negrotto et al., supra. However, various media constituents described in the 

literature may be substituted. 

A. Transformation plasmids and selectable marker 

The genes used for transformation were cloned into a vector suitable for maize 
transformation. Vectors used in this example contained the phosphomannose isomerase (PMI) 
gene for selection of transgenic lines (Negrotto et al. (2000) Plant Cell Reports 19: 798-803). 
B. Preparation of Agrobacterium tumefaciens 

Agrobacterium strain LBA4404 (pSBl) containing the plant transformation plasmid was 
grown on YEP (yeast extract (5 g/L), peptone (lOg/L), NaCl (5g/L),15g/l agar, pH 6.8) solid 
medium for 2 - 4 days at 28°C. Approximately 0.8X 10 9 Agrobacterium were suspended in LS- 
inf media supplemented with 100 As (Negrotto et al, (2000) Plant Cell Rep 19: 798-803). 
Bacteria were pre-induced in this medium for 30-60 minutes. 
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C. Inoculation 

Immature embryos from A188 or other suitable genotype were excised from 8-12 day old ears 
into liquid LS-inf + 100 |liM As. Embryos were rinsed once with fresh infection medium. 
Agrobacterium solution was then added and embryos were vortexed for 30 seconds and allowed 
to settle with the bacteria for 5 minutes. The embryos were then transferred scutellum side up to 
LSAs medium and cultured in the dark for two to three days. Subsequently, between 20 and 25 
embryos per petri plate were transferred to LSDc medium supplemented with cefotaxime (250 
mg/1) and silver nitrate (1 .6 mg/1) and cultured in the dark for 28°C for 10 days. 

D. Selection of transformed cells and regeneration of transformed plants 
Immature embryos producing embryogenic callus were transferred to LSD1M0.5S medium. The 
cultures were selected on this medium for 6 weeks with a subculture step at 3 weeks. Surviving 
calli were transferred to Regl medium supplemented with mannose. Following culturing in the 
light (16 hour light/ 8 hour dark regiment), green tissues were then transferred to Reg2 medium 
without growth regulators and incubated for 1-2 weeks. Plantlets are transferred to Magenta 
GA-7 boxes (Magenta Corp, Chicago 111.) containing Reg3 medium and grown in the light. 
After 2-3 weeks, plants were tested for the presence of the PMI genes and other genes of interest 
by PCR. Positive plants from the PCR assay were transferred to the greenhouse. 

Example 12 

Analysis of Tl seed from maize plants expressing the 
q-amvlase targeted to apoplast or to the ER 
Tl seed from self-pollinated maize plants transformed with either pNOV6200 or 
pNOV6201 as described in Example 4 were obtained. Starch accumulation in these kernels 
appeared to be normal, based on visual inspection and on normal staining for starch with an 
iodine solution prior to any exposure to high temperature. Immature kernels were dissected and 
purified endosperms were placed individually in micro fuge tubes and immersed in 200 |il of 
50 mM NaP0 4 buffer. The tubes were placed in an 85°C water bath for 20 minutes, then cooled 
on ice. Twenty microliters of a 1% iodine solution was added to each tube and mixed. 
Approximately 25% of the segregating kernels stained normally for starch. The remaining 75% 
failed to stain, indicating that the starch had been degraded into low molecular weight sugars that 
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do not stain with iodine. It was found that the Tl kernels of pNOV6200 and pNOV6201 were 
self-hydrolyzing the corn starch. There was no detectable reduction in starch following 
incubation at 37°C. 

Expression of the amylase was further analyzed by isolation of the hyperthermophilic 
protein fraction from the endosperm followed by PAGE/Coomassie staining. A segregating 
protein band of the appropriate molecular weight (50 kD) was observed. These samples are 
subjected to an ot-amylase assay using commercially available dyed amylose (AMYLAZYME, 
from Megazyme, Ireland). High levels of hyperthermophilic amylase activity correlated with the 
presence of the 50 kD protein. 

It was further found that starch in kernels from a majority of transgenic maize, which 
express hyperthermophilic oamylase, targeted to the amyloplast, is sufficiently active at ambient 
temperature to hydrolyze most of the starch if the enzyme is allowed to be in direct contact with 
a starch granule. Of the eighty lines having hyperthermophilic a -amylase targeted to the 
amyloplast, four lines were identified that accumulate starch in the kernels. Three of these lines 
were analyzed for thermostable a -amylase activity using a colorimetric amylazyme assay 
(Megazyme). The amylase enzyme assay indicated that these three lines had low levels of 
thermostable amylase activity. When purified starch from these three lines was treated with 
appropriate conditions of moisture and heat, the starch was hydrolyzed indicating the presence of 
adequate levels of a -amylase to facilitate the auto-hydrolysis of the starch prepared from these 

lines. 

Tl seed from multiple independent lines of both pNOV6200 and pNOV6201 
transformants was obtained. Individual kernels from each line were dissected and purified 
endosperms were homogenized individually in 300 |il of 50 mM NaP0 4 buffer. Aliquots of the 
endosperm suspensions were analyzed for ot-amylase activity at 85°C. Approximately 80% of 
the lines segregate for hyperthermophilic activity (See Figures 1 A, IB, and 2). 

Kernels from wild type plants or plants transformed with pNOV6201 were heated at 
100°C for 1, 2, 3, or 6 hours and then stained for starch with an iodine solution. Little or no 
starch was detected in mature kernels after 3 or 6 hours, respectively. Thus, starch in mature 
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kernels from transgenic maize which express hyperthermophilic amylase that is targeted to the 
endoplasmic reticulum was hydrolyzed when incubated at high temperature. 

In another experiment, partially purified starch from mature Tl kernels from pNOV6201 
plants that were steeped at 50°C for 16 hours was hydrolyzed after heating at 85°C for 5 minutes. 
This illustrated that the a-amylase targeted to the endoplasmic reticulum binds to starch after 
grinding of the kernel, and is able to hydrolyze the starch upon heating. Iodine staining indicated 
that the starch remains intact in mature seeds after the 16 hour steep at 50°C. 

In another experiment, segregating, mature kernels from plants transformed with 
pNOV6201 were heated at 95°C for 16 hours and then dried. In seeds expressing the 
hyperthermophilic a-amylase, the hydrolysis of starch to sugar resulted in a wrinkled appearance 
following drying. 

Example 13 

Analysis of Tl seed from maize plants expressing the 
q-amvlase targeted to the amyloplast 
Tl seed from self-pollinated maize plants transformed with either pNOV4029 or 
pNOV4031 as described in Example 4 was obtained. Starch accumulation in kernels from these 
lines was clearly not normal. All lines segregated, with some variation in severity, for a very low 
or no starch phenotype. Endosperm purified from immature kernels stained only weakly with 
iodine prior to exposure to high temperatures. After 20 minutes at 85°C, there was no staining. 
When the ears were dried, the kernels shriveled up. This particular amylase clearly had sufficient 
activity at greenhouse temperatures to hydrolyze starch if allowed to be in direct contact with the 
granule 

Example 14 

Fermentation of grain from maize plants expressing q-amvlase 
100% Transgenic grain 85°C vs. 95°C, varied liquefaction time. 
Transgenic corn (pNOV6201) that contains a thermostable a-amylase performs well in 
fermentation without addition of exogenous a-amylase, requires much less time for liquefaction 
and results in more complete solubilization of starch. Laboratory scale fermentations were 
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performed by a protocol with the following steps (detailed below): 1) grinding, 2) moisture 
analysis, 3) preparation of a slurry containing ground corn, water, backset and a-amylase, 4) 
liquefaction and 5) simultaneous saccharification and fermentation (SSF). In this example the 
temperature and time of the liquefaction step were varied as described below. In addition the 
transgenic corn was liquefied with and without exogenous a-amylase and the performance in 
ethanol production compared to control corn treated with commercially available a-amylase. 

The transgenic corn used in this example was made in accordance with the procedures set 
out in Example 4 using a vector comprising the a-amylase gene and the PMI selectable marker, 
namely pNOV6201. The transgenic corn was produced by pollinating a commercial hybrid 
(N3030BT) with pollen from a transgenic line expressing a high level of thermostable 
a-amylase. The corn was dried to 1 1% moisture and stored at room temperature. The a- 
amylase content of the transgenic corn flour was 95 units/g where 1 unit of enzyme generates 1 
micromole reducing ends per min from corn flour at 85 °C in pH 6.0 MES buffer. The control 
corn that was used was a yellow dent corn known to perform well in ethanol production. 

1) Grinding: Transgenic corn (1 1 80 g) was ground in a Perten 3 100 hammer mill 
equipped with a 2.0 mm screen thus generating transgenic com flour. Control corn was ground 
in the same mill after thoroughly cleaning to prevent contamination by the transgenic corn. 

2) Moisture analysis: Samples (20 g) of transgenic and control corn were weighed into 
aluminum weigh boats and heated at 100 C for 4 h. The samples were weighed again and the 
moisture content calculated from the weight loss. The moisture content of transgenic flour was 
9.26%; that of the control flour was 12.54%. 

3) Preparation of slurries: The composition of slurries was designed to yield a mash with 
36% solids at the beginning of SSF. Control samples were prepared in 100 ml plastic bottles and 
contained 21.50 g of control corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight), and 0.30 ml of a commercially available a-amylase diluted 1/50 with water. The a- 
amylase dose was chosen as representative of industrial usage. When assayed under the 
conditions described above for assay of the transgenic a-amylase, the control a-amylase dose 
was 2 U/g corn flour. pH was adjusted to 6.0 by addition of ammonium hydroxide. Transgenic 
samples were prepared in the same fashion but contained 20 g of corn flour because of the lower 
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moisture content of transgenic flour. Slurries of transgenic flour were prepared either with a- 
amylase at the same dose as the control samples or without exogenous a-amylase. 

4) Liquefaction: The bottles containing slurries of transgenic corn flour were immersed 
in water baths at either 85 °C or 95 °C for times of 5, 15, 30, 45 or 60 min. Control slurries were 
incubated for 60 min at 85 °C. During the high temperature incubation the slurries were mixed 
vigorously by hand every 5 min. After the high temperature step the slurries were cooled on ice. 

5) Simultaneous saccharification and fermentation: The mash produced by liquefaction 
was mixed with glucoamylase (0.65 ml of a 1/50 dilution of a commercially available L-400 
glucoamylase), protease (0.60 ml of a 1,000- fold dilution of a commercially available protease), 
0.2 mg Lactocide & urea (0.85 ml of a 10- fold dilution of 50% Urea Liquor). A hole was cut 
into the cap of the 100 ml bottle containing the mash to allow C0 2 to vent. The mash was then 
inoculated with yeast ( 1 .44 ml) and incubated in a water bath set at 90 F. After 24 hours of 
fermentation the temperature was lowered to 86 F; at 48 hours it was set to 82 F. 

Yeast for inoculation was propagated by preparing a mixture that contained yeast (0.12 g) 
with 70 grams maltodextrin, 230 ml water, 100 ml backset, glucoamylase (0.88 ml of a 10-fold 
dilution of a commercially available glucoamylase), protease (1.76 ml of a 100-fold dilution of a 
commercially available enzyme), urea (1.07 grams), penicillin (0.67 mg) and zinc sulfate (0.13 
g). The propagation culture was initiated the day before it was needed and was incubated with 
mixing at 90°F. 

At 24, 48 & 72 hour samples were taken from each fermentation vessel, filtered through 
0.2 ^m filters and analyzed by HPLC for ethanol & sugars. At 72 h samples were analyzed for 
total dissolved solids and for residual starch. 

HPLC analysis was performed on a binary gradient system equipped with refractive 
index detector, column heater & Bio-Rad Aminex HPX-87H column. The system was 
equilibrated with 0.005 M H 2 S0 4 in water at 1 ml/min. Column temperature was 50 °C. Sample 
injection volume was 5 |xl; elution was in the same solvent. The RI response was calibrated by 
injection of known standards. Ethanol and glucose were both measured in each injection. 

Residual starch was measured as follows. Samples and standards were dried at 50°C in 
an oven, then ground to a powder in a sample mill. The powder (0.2 g) was weighed into a 15 
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ml graduated centrifuge tube. The powder was washed 3 times with 10 ml aqueous ethanol 
(80% v/v) by vortexing followed by centrifugation and discarding of the supernatant. DMSO 
(2.0 ml) was added to the pellet followed by 3.0 ml of a thermostable alpha-amylase (300 units) 
in MOPS buffer. After vigorous mixing, the tubes were incubated in a water bath at 85°C for 60 
min. During the incubation, the tubes were mixed four times. The samples were cooled and 4.0 
ml sodium acetate buffer (200 mM, pH 4.5) was added followed by 0.1 ml of glucoamylase (20 
U). Samples were incubated at 50°C for 2 hours, mixed, then centrifuged for 5 min at 3,500 
rpm. The supernatant was filtered through a 0.2 urn filter and analyzed for glucose by the HPLC 
method described above. An injection size of 50 |il was used for samples with low residual 
starch (<20% of solids). 

Results Transgenic corn performed well in fermentation without added a-amylase. The 
yield of ethanol at 72 hours was essentially the same with or without exogenous a-amylase as 
shown in Table I. These data also show that a higher yield of ethanol is achieved when the 
liquefaction temperature is higher; the present enzyme expressed in the transgenic corn has 
activity at higher temperatures than other enzymes used commercially such as the Bacillus 
liquefaciens a-amylase. 
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Table I 



Liquefaction 


JLiqueiaction 


Exogenous a- 




IVl VCLU 


Ctrl Dev 

IU. L/C V * 


temp 


lime 


amylase 




Ethanol % 


% v/v 

^ u v/v 


op 


min. 

a unit 






v/v 




85 


60 


Yes 


4 


17.53 


0.18 


85 


60 


No 


4 


17.78 


0.27 


95 


60 


Yes 


2 


18.22 


ND 


95 


60 


No 


2 


18.25 


ND 



When the liquefaction time was varied, it was found that the liquefaction time required for 
efficient ethanol production was much less than the hour required by the conventional process. 
Figure 3 shows that the ethanol yield at 72 hours fermentation was almost unchanged from 15 
min to 60 min liquefaction. In addition liquefaction at 95°C gave more ethanol at each time 
point than at the 85°C liquefaction. This observation demonstrates the process improvement 
achieved by use of a hyperthermophilic enzyme. 

The control corn gave a higher final ethanol yield than the transgenic corn, but the control 
was chosen because it performs very well in fermentation. In contrast the transgenic corn has a 
genetic background chosen to facilitate transformation. Introducing the a-amylase-trait into elite 
corn germplasm by well-known breeding techniques should eliminate this difference. 

Examination of the residual starch levels of the beer produced at 72 hours (Figure 4) 
shows that the transgenic a-amylase results in significant improvement in making starch 
available for fermentation; much less starch was left over after fermentation. 

Using both ethanol levels and residual starch levels the optimal liquefaction times were 
1 5 min at 95°C and 30 min at 85°C. In the present experiments these times were the total time 
that the fermentation vessels were in the water bath and thus include a time period during which 
the temperature of the samples was increasing from room temperature to 85°C or 95°C. Shorter 
liquefaction times may be optimal in large scale industrial processes that rapidly heat the mash 
by use of equipment such as jet cookers. Conventional industrial liquefaction processes require 
holding tanks to allow the mash to be incubated at high temperature for one or more hours. The 

90 



WO 2005/096804 



PCT/US2004/007182 



present invention eliminates the need for such holding tanks and will increase the productivity of 
liquefaction equipment. 

One important function of a-amylase in fermentation processes is to reduce the viscosity 
of the mash. At all time points the samples containing transgenic corn flour were markedly less 
viscous than the control sample. In addition the transgenic samples did not appear to go through 
the gelatinous phase observed with all control samples; gelatinization normally occurs when 
corn slurries are cooked. Thus having the ot-amylase distributed throughout the fragments of the 
endosperm gives advantageous physical properties to the mash during cooking by preventing 
formation of large gels that slow diffusion and increase the energy costs of mixing and pumping 
the mash. 

The high dose of ot-amylase in the transgenic corn may also contribute to the favorable 
properties of the transgenic mash. At 85°C, the ot-amylase activity of the transgenic corn was 
many times greater activity than the of the dose of exogenous ot-amylase used in controls. The 
latter was chosen as representative of commercial use rates. 

Example 15 

Effective function of transgenic corn when mixed with control corn 
Transgenic corn flour was mixed with control corn flour in various levels from 5% to 
100% transgenic corn flour. These were treated as described in Example 14. The mashes 
containing transgenically expressed a-amylase were liquefied at 85 °C for 30 min or at 95 °C for 
15 min; control mashes were prepared as described in Example 14 and were liquefied at 85 °C 
for 30 or 60 min (one each) or at 95 °C for 15 or 60 min (one each). 

The data for ethanol at 48 and 72 hours and for residual starch are given in Table 2. The 
ethanol levels at 48 hours are graphed in Figure 5; the residual starch determinations are shown 
in Figure 6. These data show that transgenically expressed thermostable a-amylase gives very 
good performance in ethanol production even when the transgenic grain is only a small portion 
(as low as 5%) of the total grain in the mash. The data also show that residual starch is markedly 
lower than in control mash when the transgenic grain comprises at least 40% of the total grain. 
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Table 2 





85 °C Liquefaction 


95 °C Liquefaction 


Transgenic 

• 

grain 

_A.fi/ 

wt % 


Residual 
Starch 


Ethanol 
48 h 


Ethanol 

% v/v 
72 h 


Residual 
Starch 


Ethanol 
48 h 


Ethanol 

% v/v 
72 h 


100 


3.58 


16.71 


18.32 


4.19 


17.72 


21.14 


80 


4.06 


17.04 


19.2 


3.15 


17.42 


19.45 


60 


3.86 


17.16 


19.67 


4.81 


1 H CO 

17.58 


1 n c*7 


/in 


5.14 


17.28 


19.83 


8.69 


17.56 


19.51 


20 


8.77 


17.11 


19.5 


ll. 05 


17.71 


19.36 


10 


10.03 


18.05 


19.76 


10.8 


17.83 


19.28 


5 


10.67 


18.08 


19.41 


12.44 


17.61 


19.38 


0* 


7.79 


17.64 


20.11 


11.23 


17.88 


19.87 



* Control samples . Values the average of 2 determinations 



Example 16 

Ethanol production as a function of liquefaction pH using 
transgenic corn at a rate of 1.5 to 12 % of total corn 
Because the transgenic corn performed well at a level of 5-10% of total corn in a 
fermentation, an additional series of fermentations in which the transgenic corn comprised 1.5 to 
12% of the total corn was performed. The pH was varied from 6.4 to 5.2 and the a-amylase 
enzyme expressed in the transgenic corn was optimized for activity at lower pH than is 
conventionally used industrially. 

The experiments were performed as described in Example 15 with the following 

exceptions: 

1) . Transgenic flour was mixed with control flour as a percent of total dry weight at the levels 
ranging from 1.5% to 12.0%. 

2) . Control corn was N3O30BT which is more similar to the transgenic corn than the control 
used in examples 14 and 15. 

3) . No exogenous a-amylase was added to samples containing transgenic flour. 
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4) . Samples were adjusted to pH 5.2, 5.6, 6.0 or 6.4 prior to liquefaction. At least 5 samples 
spanning the range from 0% transgenic corn flour to 1 2% transgenic corn flour were prepared for 
each pH. 

5) . Liquefaction for all samples was performed at 85 °C for 60 min. 

The change in ethanol content as a function of fermentation time are shown in Figure 7. 
This figure shows the data obtained from samples that contained 3% transgenic corn. At the 
lower pH, the fermentation proceeds more quickly than at pH 6.0 and above; similar behavior 
was observed in samples with other doses of transgenic grain. The pH profile of activity of the 
transgenic enzyme combined with the high levels of expression will allow lower pH 
liquefactions resulting in more rapid fermentations and thus higher throughput than is possible at 
the conventional pH 6.0 process. 

The ethanol yields at 72 hours are shown in Figure 8. As can be seen, on the basis of 
ethanol yield, the results showed little dependence on the amount of transgenic grain included in 
the sample. Thus the grain contains abundant amylase to facilitate fermentative production of 
ethanol. It is also demonstrates that lower pH of liquefaction results in higher ethanol yield. 

The viscosity of the samples after liquefaction was monitored and it was observed that at 
pH 6.0, 6% transgenic grain is sufficient for adequate reduction in viscosity. At pH 5.2 and 5.6, 
viscosity is equivalent to that of the control at 12% transgenic grain, but not at lower percentages 
of transgenic grain. 

Example 17 

Production of fructose from corn flour using therm ophilic enzymes 

Corn that expresses the hyperthermophilic ot-amylase, 797GL3, was shown to facilitate 
production of fructose when mixed with an ot-glucosidase (MalA) and a xylose isomerase 
(XylA). 

Seed from pNOV6201 transgenic plants expressing 797GL3 were ground to a flour in a 
Kleco cell thus creating amylase flour. Non-transgenic corn kernels were ground in the same 
manner to generate control flour. 

The ct-glucosidase, MalA (from 5. solfataricus), was expressed in E. coli. Harvested 
bacteria were suspended in 50 mM potassium phosphate buffer pH 7.0 containing 1 mM 4-(2- 
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aminoethyl)benzenesulfonyl fluoride then lysed in a French pressure cell. The lysate was 
centrifiiged at 23,000 x g for 15 min at 4°C. The supernatant solution was removed, heated to 
70° C for 10 min, cooled on ice for 10 min, then centrifuged at 34,000 x g for 30 min at 4°C. The 
supernatant solution was removed and the Mai A concentrated two-fold in centricon 10 devices. 
The filtrate of the centricon 10 step was retained for use as a negative control for Mai A. 

Xylose (glucose) isomerase was prepared by expressing the xylA gene of T. neapolitana 
in E. coli. Bacteria were suspended in 100 mM sodium phosphate pH 7.0 and lysed by passage 
through a French pressure cell. After precipitation of cell debris, the extract was heated at 80° C 
for 10 min then centrifuged. The supernatant solution contained the XylA enzymatic activity. 
An empty-vector control extract was prepared in parallel with the XylA extract. 

Corn flour (60 mg per sample) was mixed with buffer and extracts from E coli. As 
indicated in Table 3, samples contained amylase corn flour (amylase) or control corn flour 
(control), 50 \x\ of either Mai A extract (+) or filtrate (-), and 20 \x\ of either XylA extract (+) or 
empty vector control (-). All samples also contained 230 |li1 of 50mM MOPS, lOmM MgS04, 
and l mM CoC12; pH of the buffer was 7.0 at room temperature. 

Samples were incubated at 85°C for 18 hours. At the end of the incubation time, samples 
were diluted with 0.9 ml of 85°C water and centrifuged to remove insoluble material. The 
supernatant fraction was then filtered through a Centricon3 ultrafiltration device and analyzed by 
HPLC with ELSD detection. 

The gradient HPLC system was equipped with Astec Polymer Amino Column, 5 micron 
particle size, 250 X 4.6 mm and an Alltech ELSD 2000 detector. The system was pre- 
equilibrated with a 15:85 mixture of watenacetonitrile. The flow rate was 1 ml/min. The initial 
conditions were maintained for 5 min after injection followed by a 20 min gradient to 50:50 
wateracetonitrile followed by 10 minutes of the same solvent. The system was washed with 20 
min of 80:20 watenacetonitrile and then re-equilibrated with the starting solvent. Fructose was 
eluted at 5.8 min and glucose at 8.7 min. 
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Table 3 
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4 
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0 


25.9 


5 


control 


+ 


+ 


0.8 


0.5 


6 


control 




+ 


0.3 


0.2 


7 


control 


+ 




1.3 


1.7 


8 


control 






0.2 


0.3 



The HPLC results also indicated the presence of larger maltooligosaccharides in all 
samples containing the a-amylase. These results demonstrate that the three thermophilic 
enzymes can function together to produce fructose from corn flour at a high temperature. 

Example 18 
Amylase Flour with Isomerase 

In another example, amylase flour was mixed with purified MalA and each of 
twobacterial xylose isomerases: XylA of T. maritima, and an enzyme designated 
BD8037obtained from Diversa. Amylase flour was prepared as described in Example 18. 

S. solfataricus MalA with a 6His purification tag was expressed in E. coli. Cell lysate was 
prepared as described in Example 18, then purified to apparent homogeneity using a nickel 
affinity resin (Probond, Invitrogen) and following the manufacturer's instructions for native 
protein purification. 

T. maritima XylA with the addition of an S tag and an ER retention signal was expressed 
in E. coli and prepared in the same manner as the T. neapolitana XylA described in Example 18. 

Xylose isomerase BD8037 was obtained as a lyophilized powder and resuspended in 
0.4x the original volume of water. 
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Amylase corn flour was mixed with enzyme solutions plus water or buffer. All reactions 
contained 60 mg amylase flour and a total of 600^il of liquid. One set of reactions was buffered 
with 50 mM MOPS, pH 7.0 at room temperature, plus lOmM MgS04 and 1 mM CoC12; in a 
second set of reactions the metal-containing buffer solution was replaced by water. Isomerase 
enzyme amounts were varied as indicated in Table 4. All reactions were incubated for 2 hours at 
90°C. Reaction supernatant fractions were prepared by centrifugation. The pellets were washed 
with an additional 600^1 H 2 0 and recentrifuged. The supernatant fractions from each reaction 
were combined, filtered through a Centricon 10, and analyzed by HPLC with ELSD detection as 
described in Example 17. The amounts of glucose and fructose observed are graphed in Figure 
15. 



Table 4 



Sample 


Amylase flour 


Mai A 


Isomerase 


1 


60 mg 


+ 


none 


2 


60 mg 


+ 


T. maritima, 100pJ 


3 


60 mg 


+ 


T. maritima, 10 |J.l 


4 


60 mg 


+ 


T. maritima, 2\x\ 


5 


60 mg 


+ 


BD8037, 100^1 


7 


60mg 


+ 


BD8037, 2\x\ 


C 


60 mg 


none 


none 



With each of the isomerases, fructose was produced from corn flour in a dose-dependent 
manner when a-amylase and ot-glucosidase were present in the reaction. These results 
demonstrate that the grain-expressed amylase 797GL3 can function with MalA and a variety of 
different thermophilic isomerases, with or without added metal ions, to produce fructose from 
corn flour at a high temperature. In the presence of added divalent metal ions, the isomerases 
can achieve the predicted fructose: glucose equilibrium at 90°C of approximately 55% fructose. 
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This would be an improvement over the current process using mesophilic isomerases, which 
requires a chromatographic separation to increase the fructose concentration. 

Example 19 
Expression of a pullulanase in corn 

Transgenic plants that were homozygous for either pNOV7013 or pNOV7005 were 
crossed to generate transgenic corn seed expressing both the 797GL3 a-amylase and 6GP3 
pullulanase. 

Tl or T2 seed from self-pollinated maize plants transformed with either pNOV 7005 or 
pNOV 4093 were obtained. pNOV4093 is a fusion of the maize optimized synthetic gene for 
6GP3 (SEQ ID: 3,4) with the amyloplast targeting sequence (SEQ ID NO: 7,8) for localization 
of the fusion protein to the amyloplast. This fusion protein is under the control of the ADPgpp 
promoter (SEQ ED NO: 1 1) for expression specifically in the endosperm. The pNOV7005 
construct targets the expression of the pullulanase in the endoplasmic reticulum of the 
endosperm. Localization of this enzyme in the ER allows normal accumulation of the starch in 
the kernels. Normal staining for starch with an iodine solution was also observed, prior to any 
exposure to high temperature. 

As described in the case of a-amylase the expression of pullulanase targeted to the 
amyloplast (pNOV4093) resulted in abnormal starch accumulation in the kernels. When the 
corn-ears are dried, the kernels shriveled up. Apparently, this thermophilic pullulanase is 
sufficiently active at low temperatures and hydrolyzes starch if allowed to be in direct contact 
with the starch granules in the seed endosperm. 

Enzyme preparation or extraction of the enzyme from corn-flour : The pullulanase 
enzyme was extracted from the transgenic seeds by grinding them in Kleco grinder, followed by 
incubation of the flour in 50mM NaOAc pH 5.5 buffer for 1 hr at RT, with continuous shaking. 
The incubated mixture was then spun for 15min. at 14000 rpm. The supernatant was used as 
enzyme source. 

Pullulanase assay : The assay reaction was carried out in 96-well plate. The enzyme 
extracted from the corn flour (100 /xl) was diluted 10 fold with 900 /xl of 50mM NaOAc pH5.5 
buffer, containing 40 mM CaCl 2 . The mixture was vortexed, 1 tablet of Limit-Dextrizyme 
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(azurine-crosslinked-pullulan, from Megazyme) was added to each reaction mixture and 
incubated at 75 °C for 30 min (or as mentioned). At the end of the incubation the reaction 
mixtures were spun at 3500 rpm for 15 min. The supernatants were diluted 5 fold and 
transferred into 96-well flat bottom plate for absorbance measurement at 590 nm. Hydrolysis of 
azurine-crosslinked-pullulan substrate by the pullulanase produces water-soluble dye fragments 
and the rate of release of these (measured as the increase in absorbance at 590 nm) is related 
directly to enzyme activity. 

Figure 9 shows the analysis of T2 seeds from different events transformed with pNOV 
7005. High expression of pullulanase activity, compared to the non-transgenic control, can be 
detected in a number of events. 

To a measured amount (-100 fig) of dry corn flour from transgenic (expressing 
pullulanase, or amylase or both the enzymes) and / or control (non-transgenic) 1000 /xl of 50 mM 
NaOAc pH 5.5 buffer containing 40 mM CaCl 2 was added. The reaction mixtures were vortexed 
and incubated on a shaker for 1 hr. The enzymatic reaction was started by transferring the 
incubation mixtures to high temperature (75 °C, the optimum reaction temperature for 
pullulanase or as mentioned in the figures) for a period of time as indicated in the figures. The 
reactions were stopped by cooling them down on ice. The reaction mixtures were then 
centrifuged for 10 min. at 14000 rpm. An aliquot (100 fil) of the supernatant was diluted three 
fold, filtered through 0.2-micron filter for HPLC analysis. 

The samples were analyzed by HPLC using the following conditions: 

Column: Alltech Prevail Carbohydrate ES 5 micron 250 X 4.6 mm 

Detector: Alltech ELSD 2000 

Pump: Gilson 322 

Injector: Gilson 215 injector/diluter 

Solvents: HPLC grade Acetonitrile (Fisher Scientific) and Water (purified by Waters 
Millipore System) 
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Gradient used for oligosaccharides of low degree of polymerization (DP 1-15). 

Time %Water %Acetonitrile 

0 15 85 

5 15 85 

25 50 50 

35 50 50 

36 80 20 

55 80 20 

56 15 85 
76 15 85 

Gradient used for saccharides of high degree of polymerization (DP 20- 100 and above). 

Time %Water %Acetonitrile 

0 35 65 

60 85 15 

70 85 15 

85 35 65 

100 35 65 

System used for data analysis: Gilson Unipoint Software System Version 3.2 

Figures 10A and 10B show the HPLC analysis of the hydrolytic products generated by 
expressed pullulanase from starch in the transgenic corn flour. Incubation of the flour of 
pullulanase expressing corn in reaction buffer at 75 °C for 30 minutes results in production of 
medium chain oligosaccharides (DP -10-30) and short amylose chains (DP ~ 100 -200) from 
cornstarch. This figure also shows the dependence of pullulanase activity on presence of 
calcium ions. 

Transgenic corn expressing pullulanase can be used to produce modified-starch/dextrin 
that is debranched (al-6 linkages cleaved) and hence will have high level of amylose/straight 
chain dextrin. Also depending on the kind of starch (e.g. waxy, high amylose etc.) used the 
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chain length distribution of the amylose/dextrin generated by the pullulanase will vary, and so 
will the property of the modified-starch/dextrin. 

Hydrolysis of a 1-6 linkage was also demonstrated using pullulan as the substrate. The 
pullulanase isolated from corn flour efficiently hydrolyzed pullulan. HPLC analysis (as 
described) of the product generated at the end of incubation showed production of maltotriose, as 
expected, due to the hydrolysis of the a 1-6 linkages in the pullulan molecules by the enzyme 
from the corn. 

Example 20 

Expression of pullulanase in corn 

Expression of the 6gp3 pullulanase was further analyzed by extraction from corn flour 
followed by PAGE and Coomassie staining. Corn-flour was made by grinding seeds, for 30 sec, 
in the Kleco grinder. The enzyme was extracted from about 150mg of flour with 1ml of 50mM 
NaOAc pH 5.5 buffer. The mixture was vortexed and incubated on a shaker at RT for lhr, 
followed by another 15 min incubation at 70 °C. The mixture was then spun down (14000 rpm 
for 15 min at RT) and the supernatant was used as SDS-PAGE analysis. A protein band of the 
appropriate molecular weight (95 kDal) was observed. These samples are subjected to a 
pullulanase assay using commercially available dye-conjugated limit-dextrins (LIMIT- 
DEXTRIZYME, from Megazyme, Ireland). High levels of thermophilic pullulanase activity 
correlated with the presence of the 95 kD protein. 

The Western blot and ELISA analysis of the transgenic corn seed also demonstrated the 
expression of -95 kD protein that reacted with antibody produced against the pullulanase 
(expressed in E. coli). 

Example 21 

Increase in the rate of starch hydrolysis and improved yield 
of small chain (fermentable) oligosaccharides 
by the addition of pullulanase expressing corn 
The data shown in Figures 1 1 A and 1 IB was generated from HPLC analysis, as 
described above, of the starch hydrolysis products from two reaction mixtures. The first reaction 
indicated as 'Amylase' contains a mixture [1:1 (w/w)] of corn flour samples of a -amylase 
expressing transgenic corn made according to the method described in Example 4, for example, 
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and non-transgenic corn A188; and the second reaction mixture 'Amylase + Pullulanase' 
contains a mixture [1:1 (w/w)] of corn flour samples of a -amylase expressing transgenic corn 
and pullulanase expressing transgenic corn made according to the method described in Example 
19. The results obtained support the benefit of use of pullulanase in combination with a-amylase 
during the starch hydrolysis processes. The benefits are from the increased rate of starch 
hydrolysis (Figure 1 1 A) and increase yield of fermentable oligosaccharides with low DP (Figure 
11B). 

It was found that a-amylase alone or a-amylase and pullulanase (or any other 
combination of starch hydrolytic enzymes) expressed in corn can be used to produce 
maltodextrin (straight or branched oligosaccharides) (Figures 1 1 A, 1 IB, 12, and 13 A). 
Depending on the reaction conditions, the type of hydrolytic enzymes and their combinations, 
and the type of starch used the composition of the maltodextrins produced, and hence their 
properties, will vary. 

Figure 12 depicts the results of an experiment carried out in a similar manner as 
described for Figure 1 1 . The different temperature and time schemes followed during incubation 
of the reactions are indicated in the figure. The optimum reaction temperature for pullulanase is 
75 °C and for a -amylase it is >95 °C. Hence, the indicated schemes were followed to provide 
scope to carry out catalysis by the pullulanase and/or the a? -amylase at their respective optimum 
reaction temperature. It can be clearly deduced from the result shown that combination of a- 
amylase and pullulanase performed better in hydrolyzing cornstarch at the end of 60 min 
incubation period. 

HPLC analysis, as described above (except -150 mg of corn flour was used in these 

reactions), of the starch hydrolysis product from two sets of reaction mixtures at the end of 30 

min incubation is shown in Figure 13A and 13B. The first set of reactions was incubated at 85 °C 

and the second one was incubated at 95 °C. For each set there are two reaction mixtures; the first 

reaction indicated as 'Amylase X Pullulanase' contains flour from transgenic corn (generated by 

cross pollination) expressing both the a-amylase and the pullulanase, and the second reaction 

indicated as 'Amylase' mixture of corn flour samples of a-amylase expressing transgenic corn 

and non-transgenic corn A188 in a ratio so as to obtain same amount of a -amylase activity as is 

observed in the cross (Amylase X Pullulanase). The total yield of low DP oligosaccharides was 
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more in case of a-amylase and pullulanase cross compared to corn expressing a -amylase alone, 
when the corn flour samples were incubated at 85 °C. The incubation temperature of 95 °C 
inactivates (at least partially) the pullulanase enzyme, hence little difference can be observed 
between 'Amylase X Pullulanase' and 'Amylase', However, the data for both the incubation 
temperatures shows significant improvement in the amount of glucose produced (Figure 13B), at 
the end of the incubation period, when corn flour of a -amylase and pullulanase cross was used 
compared to corn expressing a -amylase alone. Hence use of corn expressing both a -amylase 
and pullulanase can be especially beneficial for the processes where complete hydrolysis of 
starch to glucose is important. 

The above examples provide ample support that pullulanase expressed in corn seeds, 
when used in combination with a-amylase, improves the starch hydrolysis process. Pullulanase 
enzyme activity, being a 1-6 linkage specific, debranches starch far more efficiently than a - 
amylase (an a -1-4 linkage specific enzyme) thereby reducing the amount of branched 
oligosaccharides {e.g. limit-dextrin, panose; these are usually non-fermentable) and increasing 
the amount of straight chain short oligosaccharides (easily fermentable to ethanol etc.). 
Secondly, fragmentation of starch molecules by pullulanase catalyzed debranching increases 
substrate accessibility for the a-amylase, hence an increase in the efficiency of the a -amylase 
catalyzed reaction results. 

Example 22 

To determine whether the 797GL3 alpha amylase and malA alpha-glucosidase could 
function under similar pH and temperature conditions to generate an increased amount of 
glucose over that produced by either enzyme alone, approximately 0.35 ug of malA alpha 
glucosidase enzyme (produced in bacteria) was added to a solution containing 1% starch and 
starch purified from either non-transgenic corn seed (control) or 797GL3 transgenic corn seed (in 
797GL3 corn seed the alpha amylase co-purifies with the starch). In addition, the purified starch 
from non-transgenic and 797GL3 transgenic corn seed was added to 1% corn starch in the 
absence of any malA enzyme. The mixtures were incubated at 90°C, pH 6.0 for 1 hour, spun 
down to remove any insoluble material, and the soluble fraction was analyzed by HPLC for 
glucose levels. As shown in Figure 14, the 797GL3 alpha-amylase and malA alpha-glucosidase 
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function at a similar pH and temperature to break down starch into glucose. The amount of 
glucose generated is significantly higher than that produced by either enzyme alone. 

Example 23 

The utility of the Thermoanaerobacterium glucoamylase for raw starch hydrolysis was 
determined. As set forth in Figure 15, the hydrolysis conversion of raw starch was tested with 
water, barley a-amylase (commercial preparation from Sigma), Thermoanaerobacterum 
glucoamylase, and combinations thereof were ascertained at room temperature and at 30°C. As 
shown, the combination of the barley a-amylase with the Thermoanaerobacterium glucoamylase 
was able to hydrolyze raw starch into glucose. Moreover, the amount of glucose produced by the 
barley amylase and thermoanaerobacter GA is significantly higher than that produced by either 
enzyme alone. 

Example 24 

Maize-optimized genes and sequences for raw-starch hydrolysis and 

vectors for plant transformation 
The enzymes were selected based on their ability to hydrolyze raw-starch at temperatures 
ranging from approximately 20°-50°C. The corresponding genes or gene fragments were then 
designed by using maize preferred codons for the construction of synthetic genes as set forth in 
Example 1. 

Aspergillus shirousami a-amylase/glucoamylase fusion polypeptide (without signal 
sequence) was selected and has the amino acid sequence as set forth in SEQ ID NO: 45 as 
identified in Biosci. Biotech. Biochem., 56:884-889 (1992); Agric. Biol. Chem. 545:1905-14 
(1990); Biosci. Biotechnol. Biochem. 56:174-79 (1992). The maize-optimized nucleic acid was 
designed and is represented in SEQ ID NO:46. 

Similarly, Thermoanaerobacterium thermosaccharolyticum glucoamylase was selected, 
having the amino acid of SEQ ID NO:47 as published in Biosci. Biotech. Biochem., 62:302-308 
(1998), was selected. The maize-optimized nucleic acid was designed (SEQ ED NO: 48). 
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Rhizopus oryzae glucoamylase was selected having the amino acid sequence (without 
signal sequence)(SEQ ID NO: 50), as described in the literature (Agric. Biol. Chem. (1986) 50, 
pg 957-964). The maize-optimized nucleic acid was designed and is represented in SEQ ID 

NO:5L 

Moreover, the maize a-amylase was selected and the amino acid sequence (SEQ ED NO: 
51) and nucleic acid sequence (SEQ ED NO:52) were obtained from the literature. See, e.g., 
Plant Physiol. 105:759-760(1994). 

Expression cassettes are constructed to express the Aspergillus shirousami ot- 
amylase/glucoamylase fusion polypeptide from the maize-optimized nucleic acid was designed 
as represented in SEQ ED NO:46, the Thermoanaerobacterium thermosaccharolyticum 
glucoamylase from the maize-optimized nucleic acid was designed as represented in SEQ ID 
NO: 48, the Rhizopus oryzae glucoamylase was selected having the amino acid sequence 
(without signal sequence)(SEQ ED NO: 49) from the maize-optimized nucleic acid was designed 
and is represented in SEQ ED NO:50, and the maize a-amylase. 

A plasmid comprising the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) is fused to the synthetic gene encoding the 
enzyme. Optionally, the sequence SEKDEL is fused to the C-terminal of the synthetic gene for 
targeting to and retention in the ER. The fusion is cloned behined the maize y-zein promoter for 
expression specifically in the endosperm in a plant transformation plasmid. The fusion is 
delivered to the corn tissue via Agrobacterium transfection. 

Example 25 

Expression cassettes comprising the selected enzymes are constructed to express the 
enzymes. A plasmid comprising the sequence for a raw starch binding site is fused to the 
synthetic gene encoding the enzyme. The raw starch binding site allows the enzyme fusion to 
bind to non-gelatinized starch. The raw-starch binding site amino acid sequence (SEQ ED 
NO:53) was determined based on literature, and the nucleic acid sequence was maize-optimized 
to give SEQ ID NO: 54. The maize-optimized nucleic acid sequence is fused to the synthetic 
gene encoding the enzyme in a plasmid for expression in a plant. 
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Example 26 

Construction of maize-optimized genes and vectors for plant transformation 
The genes or gene fragments were designed by using maize preferred codons for the 

construction of synthetic genes as set forth in Example 1. 

Pyrococcus furiosus EGLA, hyperthermophilic endoglucanase amino acid sequence 

(without signal sequence) was selected and has the amino acid sequence as set forth in SEQ ID 

NO: 55, as identified in Journal of Bacteriology (1999) 181, pg 284-290.) The maize-optimized 

nucleic acid was designed and is represented in SEQ ID NO:56. 

Thermus flavus xylose isomerase was selected and has the amino acid sequence as set 

forth in SEQ ID NO:57, as described in Applied Biochemistry and Biotechnology 62:15-27 

(1997). 

Expression cassettes are constructed to express the Pyrococcus furiosus EGLA 
(endoglucanase) from the maize-optimized nucleic acid (SEQ ID NO: 56) and the Thermus flavus 
xylose isomerase from a maize-optimized nucleic acid encoding amino acid sequence SEQ ID 
NO:57 A plasmid comprising the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) is fused to the synthetic maize-optimized 
gene encoding the enzyme. Optionally, the sequence SEKDEL is fused to the C-terminal of the 
synthetic gene for targeting to and retention in the ER. The fusion is cloned behined the maize y- 
zein promoter for expression specifically in the endosperm in a plant transformation plasmid. 
The fusion is delivered to the corn tissue via Agrobacterium transfection. 

Example 27 

Production of elucose from corn flour using thermophilic enzymes expressed in corn 

Expression of the hyperthermophilic ot-amylase, 797GL3 and oc-glucosidase (Mai A) were 
shown to result in production of glucose when mixed with an aqueous solution and incubated at 
90 °C 

A transgenic corn line (line 168A10B, pNOV4831) expressing MalA enzyme was 
identified by measuring ot-glucosidase activity as indicated by hydrolysis of p-nitrophenyl-a- 
glucoside. 
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Corn kernels from transgenic plants expressing 797GL3 were ground to a flour in a Kleco 
cell thus creating amylase flour. Corn kernels from transgenic plants expressing MalA were 
ground to a flour in a Kleco cell thus creating MalA flour Non-transgenic corn kernels were 
ground in the same manner to generate control flour. 

Buffer was 50 mM MES buffer pH 6.0. 
Corn flour hydrolysis reactions : Samples were prepared as indicated in Table 5 below. Corn 
flour (about 60 mg per sample) was mixed with 40 ml of 50 mM MES buffer, pH 6.0. Samples 
were incubated in a water bath set at 90°C for 2.5 and 14 hours. At the indicated incubation 
times, samples were removed and analyzed for glucose content. 

The samples were assayed for glucose by a glucose oxidase / horse radish peroxidase 
based assay. GOPOD reagent contained: 0.2 mg/ml o-dianisidine, 100 mM Tris pH 7.5 , 100 
U/ml glucose oxidase & 10 U/ml horse radish peroxidase. 20 ^il of sample or diluted sample 
were arrayed in a 96 well plate along with glucose standards (which varied from 0 to 0.22 
mg/ml). 100 \x\ of GOPOD reagent was added to each well with mixing and the plate incubated 
at 37 °C for 30 min. 100 \i\ of sulfuric acid (9M) was added and absorbance at 540 nm was read. 
The glucose concentration of the samples was determined by reference to the standard curve. 
The quantity of glucose observed in each sample is indicated in Table 5. 
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These data demonstrate that when expression of hyperthermophilic a-amylase and a- 
glucosidase in corn result in a corn product that will generate glucose when hydrated and heated 
under appropriate conditions. 

Example 28 

Production of Maltodextrins 
Grain expressing thermophilic a-amylase was used to prepare maltodextrins. The 
exemplified process does not require prior isolation of the starch nor does it require addition of 
exogenous enzymes. 

Corn kernels from transgenic plants expressing 797GL3 were ground to a flour in a Kleco 
cell to create "amylase flour". A mixture of 10% transgenic/90% non-transgenic kernels was 
ground in the same manner to create "10% amylase flour." 

Amylase flour and 10% amylase flour (approximately 60 mg/sample) were mixed with 
water at a rate of 5 |il of water per mg of flour. The resulting slurries were incubated at 90°C for 
up to 20 hours as indicated in Table 6. Reactions were stopped by addition of 0.9 ml of 50 mM 
EDTA at 85°C and mixed by pipetting. Samples of 0.2 ml of slurry were removed, centrifuged 
to remove insoluble material and diluted 3x in water. 

The samples were analyzed by HPLC with ELSD detection for sugars and maltodextrins. The 
gradient HPLC system was equipped with Astec Polymer Amino Column, 5 micron particle size, 
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250 X 4.6 mm and an Alltech ELSD 2000 detector. The system was pre-equilibrated with a 
15:85 mixture of watenacetonitrile. The flow rate was 1 ml/min. The initial conditions were 
maintained for 5 min after injection followed by a 20 min gradient to 50:50 watenacetonitrile 
followed by 10 minutes of the same solvent. The system was washed with 20 min of 80:20 
water: acetonitrile and then re-equilibrated with the starting solvent. 

The resulting peak areas were normalized for volume and weight of flour. The response 
factor of ELSD per |ig of carbohydrate decreases with increasing DP, thus the higher DP 
maltodextrins represent a higher percentage of the total than indicated by peak area. 

The relative peak areas of the products of reactions with 100% amylase flour are shown 
in Figure 17. The relative peak areas of the products of reactions with 10% amylase flour are 
shown in Figure 18. 

These data demonstrate that a variety of maltodextrin mixtures can be produced by 
varying the time of heating. The level of ct-amylase activity can be varied by mixing transgenic 
ot-amylase-expressing corn with wild-type corn to alter the maltodextrin profile. 

The products of the hydrolysis reactions described in this example can be concentrated 
and purified for food and other applications by use of a variety of well defined methods 
including: centrifugation, filtration, ion-exchange, gel permeation, ultrafiltration, nano filtration, 
reverse osmosis, decolorizing with carbon particles, spray drying and other standard techniques 
known to the art. 

Example 29 

Effect of time and temperature on maltodextrin production 
The composition of the maltodextrin products of autohydrolysis of grain containing 
thermophilic a-amylase may be altered by varying the time and temperature of the reaction. 

In another experiment, amylase flour was produced as described in Example 28 above 
and mixed with water at a ratio of 300|il water per 60 mg flour. Samples were incubated at 70% 
80°, 90°, or 100° C for up to 90 minutes. Reactions were stopped by addition of 900ml of 50mM 
EDTA at 90°C, centrifuged to remove insoluble material and filtered through 0.45 |am nylon 
filters. Filtrates were analyzed by HPLC as described in Example 28. 
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The result of this analysis is presented in Figure 19. The DP number nomenclature refers 
to the degree of polymerization. DP2 is maltose; DP3 is maltotriose, etc. Larger DP 
maltodextrins eluted in a single peak near the end of the elution and are labeled ">DP12'\ This 
aggregate includes dextrins that passed through 0.45 |im filters and through the guard column 
and does not include any very large starch fragments trapped by the filter or guard column. 

This experiment demonstrates that the maltodextrin composition of the product can be 
altered by varying both temperature and incubation time to obtain the desired 
maltooligosaccharide or maltodextrin product. 

Example 30 

Maltodextrin production 

The composition of maltodextrin products from transgenic maize containing thermophilic 
a-amylase can also be altered by the addition of other enzymes such as a -glucosidase and xylose 
isomerase as well as by including salts in the aqueous flour mixture prior to treating with heat. 

In another, amylase flour, prepared as described above, was mixed with purified MalA 
and/or a bacterial xylose isomerase, designated BD8037. S. sulfotaricus MalA with a 6His 
purification tag was expressed in E. coli. Cell lysate was prepared as described in Example 28, 
then purified to apparent homogeneity using a nickel affinity resin (Probond, Invitrogen) and 
following the manufacturer's instructions for native protein purification. Xylose isomerase 
BD8037 was obtained as a lyophilized powder from Diversa and resuspended in 0.4x the original 
volume of water. 

Amylase corn flour was mixed with enzyme solutions plus water or buffer. All reactions 
contained 60 mg amylase flour and a total of 600|il of liquid. One set of reactions was buffered 
with 50 mM MOPS, pH 7.0 at room temperature, plus lOmM MgS04 and 1 mM CoCl 2 ; in a 
second set of reactions the metal-containing buffer solution was replaced by water. All reactions 
were incubated for 2 hours at 90°C. Reaction supernatant fractions were prepared by 
centrifugation. The pellets were washed with an additional 600|il H 2 0 and re-centrifuged. The 
supernatant fractions from each reaction were combined, filtered through a Centricon 10, and 
analyzed by HPLC with ELSD detection as described above. 
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The results are graphed in Figure 20. They demonstrate that the grain-expressed amylase 
797GL3 can function with other thermophilic enzymes, with or without added metal ions, to 
produce a variety of maltodextrin mixtures from corn flour at a high temperature. In particular, 
the inclusion of a glucoamylase or a-glucosidase may result in a product with more glucose and 
other low DP products. Inclusion of an enzyme with glucose isomerase activity results in a 
product that has fructose and thus would be sweeter than that produced by amylase alone or 
amylase with a-glucosidase. In addition the data indicate that the proportion of DPS, DP6 and 
DP7 maltooligosaccharides can be increased by including divalent cationic salts, such as CoCl 2 
and MgS0 4 . 

Other means of altering the maltodextrin composition produced by a reaction such as that 
described here include: varying the reaction pH, varying the starch type in the transgenic or non- 
transgenic grain, varying the solids ratio, or by addition of organic solvents. 

Example 31 

Preparing dextrins. or sugars from grain without mechanical 
disruption of the grain prior to recovery of starch-derived products 
Sugars and maltodextrins were prepared by contacting the transgenic grain expressing the 
a-amylase, 797GL3, with water and heating to 90°C overnight (>14 hours). Then the liquid was 
separated from the grain by filtration. The liquid product was analyzed by HPLC by the method 
described in Example 15. Table 6 presents the profile of products detected. 



110 



WO 2005/096804 



PCT/US2004/007182 



Table 6 
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* Quantification of DP3 includes maltotriose and may include isomers of maltotriose that 
have an cc(l-»6) bond in place of an cc(l->4) bond. Similarly DP4 to DP7 quantification 
includes the linear maltooligosaccarides of a given chain length as well as isomers that have 
one or more a(l~»6) bonds in place of one or more a(l->4) bonds 

These data demonstrate that sugars and maltodextrins can be prepared by contacting 
intact ct-amylase-expressing grain with water and heating. The products can then be separated 
from the intact grain by filtration or centrifugation or by gravitational settling. 

Example 32 

Fermentation of raw starch in corn expressing Rhizopus owzae glucoamylase. 
Transgenic corn kernels are harvested from transgenic plants made as described in 
Example 29. The kernels are ground to a flour. The corn kernels express a protein that contains 
an active fragment of the glucoamylase of Rhizopus oryzae (Sequence ID NO: 49) targeted to the 

endoplasmic reticulum. 

The corn kernels are ground to a flour as described in Example 15. Then a mash is 
prepared containing s 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
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components are added to the mash: protease (0.60 ml of a 1,000-fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 50% Urea Liquor). 
A hole is cut into the cap of the 100 ml bottle containing the mash to allow CO2 to vent. The 
mash is then inoculated with yeast (1.44 ml) and incubated in a water bath set at 90° C. After 24 
hours of fermentation the temperature is lowered to 86°C; at 48 hours it is set to 82 °C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 33 

Transgenic corn kernels are harvested from transgenic plants made as described in 
Example 28. The kernels are ground to a flour. The corn kernels express a protein that contains 
an active fragment of the glucoamylase of Rhizopus oryzae (Sequence ED NO: 49) targeted to the 
endoplasmic reticulum. 

The corn kernels are ground to a flour as described in Example 15. Then a mash is 
prepared containing 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
components are added to the mash: protease (0.60 ml of a 1,000- fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 50% Urea Liquor). 
A hole is cut into the cap of the 100 ml bottle containing the mash to allow CO2 to vent. The 
mash is then inoculated with yeast (1.44 ml) and incubated in a water bath set at 90° C. After 24 
hours of fermentation the temperature is lowered to 86° C; at 48 hours it is set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 34 

Example of fermentation of raw starch in whole kernels of corn expressing 
Rhizopus oryzae elucoamylase with addition of exogenous q-amylase 
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Transgenic corn kernels are harvested from transgenic plants" made as described in 
Example 28. The corn kernels express a protein that contains an active fragment of the 
glucoamylase of Rhizopus oryzae (Sequence ED NO: 49) targeted to the endoplasmic reticulum. 

The corn kernels are contacted with 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml 
of backset (8% solids by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. 
The following components are added: barley ot-amylase purchased from Sigma (2 mg), protease 
(0.60 ml of a 1,000-fold dilution of a commercially available protease), 0.2 mg Lactocide & urea 
(0.85 ml of a 10-fold dilution of 50% Urea Liquor). A hole is cut into the cap of the 100 ml 
bottle containing the mixture in order to allow C0 2 to vent. The mixture is then inoculated with 
yeast (1 .44 ml) and incubated in a water bath set at 90° C. After 24 hours of fermentation the 
temperature is lowered to 86° C; at 48 hours it is set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 35 

Fermentation of raw starch in corn expressing Rhizopus oryzae 

glucoamylase and Zea mays amylase 
Transgenic corn kernels are harvested from transgenic plants made as described in 
Example 28. The corn kernels express a protein that contains an active fragment of the 
glucoamylase of Rhizopus oryzae (Sequence ED NO:49) targeted to the endoplasmic reticulum. 
The kernels also express the maize amylase with raw starch binding domain as described in 
Example 28. 

The corn kernels are ground to a flour as described in Example 14. Then a mash is 
prepared containing 20 g of com flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
components are added to the mash: protease (0.60 ml of a 1,000- fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 50% Urea Liquor). 
A hole is cut into the cap of the 100 ml bottle containing the mash to allow C0 2 to vent. The 
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mash is then inoculated with yeast (1 .44 ml) and incubated in a water bath set at 90 F. After 24 
hours of fermentation the temperature is lowered to 86 F; at 48 hours it is set to 82 F. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 36 

Example of fermentation of raw starch in corn expressing 
Thermoanaerohacter thermosaccharolvticum glucoamylase . 
Transgenic com kernels are harvested from transgenic plants made as described in 
Example 28. The corn kernels express a protein that contains an active fragment of the 
glucoamylase of Thermoanaerohacter thermosaccharolyticum (Sequence ID NO: 47) targeted to 

the endoplasmic reticulum. 

The corn kernels are ground to a flour as described in Example 15. Then a mash is 
prepared containing 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
components are added to the mash: protease (0.60 ml of a 1,000-fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 50% Urea Liquor). 
A hole is cut into the cap of the 100 ml bottle containing the mash to allow C0 2 to vent. The 
mash is then inoculated with yeast (1 .44 ml) and incubated in a water bath set at 90° C. After 24 
hours of fermentation the temperature is lowered to 86° C; at 48 hours it is set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 37 

Example of fermentation of raw starch in corn expressing 

Aspergillus niser glucoamylase 

Transgenic corn kernels are harvested from transgenic plants made as described in 

Example 28. The corn kernels express a protein that contains an active fragment of the 

glucoamylase of Aspergillus niger (Fiil,N.P. "Glucoamylases Gl and G2 from Aspergillus niger 
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are synthesized from two different but closely related mRNAs" EMBO J. 3 (5), 1097-1 102 
(1984), Accession number P04064). The maize-optimized nucleic acid encoding the 
glucoamylase has SEQ ID NO:59 and is targeted to the endoplasmic reticulum. 

The corn kernels are ground to a flour as described in Example 14. Then a mash is 
prepared containing 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
components are added to the mash: protease (0.60 ml of a 1,000-fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 50% Urea Liquor). 
A hole is cut into the cap of the 100 ml bottle containing the mash to allow C0 2 to vent. The 
mash is then inoculated with yeast (1.44 ml) and incubated in a water bath set at 90° C. After 24 
hours of fermentation the temperature is lowered to 86° C; at 48 hours it is set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 38 

Example of fermentation of raw starch in corn expressing 
Aspergillus niger glucoamylase and Zea mays amylase 

Transgenic corn kernels are harvested from transgenic plants made as described in 
Example 28. The corn kernels express a protein that contains an active fragment of the 
glucoamylase of Aspergillus niger (Fiil,N.P. "Glucoamylases Gl and G2 from Aspergillus niger 
are synthesized from two different but closely related mRNAs" EMBO J. 3 (5), 1097-1 102 
(1984) : Accession number P04064)(SEQ ID NO:59, maize-optimized nucleic acid) and is 
targeted to the endoplasmic reticulum. The kernels also express the maize amylase with raw 
starch binding domain as described in example 28. 

The corn kernels are ground to a flour as described in Example 14. Then a mash is 
prepared containing 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
components are added to the mash: protease (0.60 ml of a 1,000-fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 50% Urea Liquor). 
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A hole is cut into the cap of the 100 ml bottle containing the mash to allow C0 2 to vent. The 
mash is then inoculated with yeast (1 .44 ml) and incubated in a water bath set at 90° C. After 24 
hours of fermentation the temperature is lowered to 86° C; at 48 hours it is set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 39 

Example of fermentation of raw starch in corn expressing 
Thermoanaerobacter thermosaccharolvticum glucoamvlase and barley amylase 

Transgenic corn kernels are harvested from transgenic plants made as described in 
Example 28. The corn kernels express a protein that contains an active fragment of the 
glucoamylase of Thermoanaerobacter thermosaccharolyticum (Sequence ID NO: 47) targeted to 
the endoplasmic reticulum. The kernels also express the low pi barley amylase amyl gene 
(RogersJ.C. and Milliman,C. "Isolation and sequence analysis of a barley alpha-amylase cDNA 
clone" J. Biol. Chem. 258 (13), 8169-8174 (1983) modified to target expression of the protein to 
the endoplasmic reticulum. 

The corn kernels are ground to a flour as described in Example 14. Then a mash is 
prepared containing 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
components are added to the mash: protease (0.60 ml of a 1,000-fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 50% Urea Liquor). 
A hole is cut into the cap of the 100 ml bottle containing the mash to allow C0 2 to vent. The 
mash is then inoculated with yeast (1 .44 ml) and incubated in a water bath set at 90° C. After 24 
hours of fermentation the temperature is lowered to 86° C; at 48 hours it is set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 
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Example 40 

Example of fermentation of raw starch in whole kernals of corn expressing Thermoanaerobacter 

thermosaccharolyticum glucoamylase and barley amylase. 
Transgenic corn kernels are harvested from transgenic plants made as described in 
Example 28. The corn kernels express a protein that contains an active fragment of the 
glucoamylase of Thermoanaerobacter thermosaccharolyticum (Sequence ID NO: 47) targeted to 
the endoplasmic reticulum. The kernels also express the low pi barley amylase amyl gene 
(RogersJ.C. and Milliman,C. "Isolation and sequence analysis of a barley alpha-amylase cDNA 
clone" J. Biol. Chem. 258 (13), 8169-8174 (1983) modified to target expression of the protein to 
the endoplasmic reticulum. 

The corn kernels are contacted with 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml 
of backset (8% solids by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. 
The following components are added to the mixture: protease (0.60 ml of a 1,000-fold dilution 
of a commercially available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10-fold dilution of 
50% Urea Liquor). A hole is cut into the cap of the 100 ml bottle containing the mash to allow 
C0 2 to vent. The mixture is then inoculated with yeast (1.44 ml) and incubated in a water bath 
set at 90° C. After 24 hours of fermentation the temperature is lowered to 86° C; at 48 hours it is 
set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 
Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 41 

Example of fermentation of raw starch in com expressing an 

alpha-amylase and glucoamylase fusion. 

Transgenic corn kernels are harvested from transgenic plants made as described in 

Example 28. The corn kernels express a maize-optimized polynucleotide such as provided in 

SEQ ID NO: 46, encoding an alpha-amylase and glucoamylase fusion, such as provided in SEQ 

ED NO: 45, which are targeted to the endoplasmic reticulum. . 

The corn kernels are ground to a flour as described in Example 14. Then a mash is 

prepared containing 20 g of corn flour, 23 ml of de-ionized water, 6.0 ml of backset (8% solids 
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by weight). pH is adjusted to 6.0 by addition of ammonium hydroxide. The following 
components are added to the mash: protease (0.60 ml of a 1,000-fold dilution of a commercially 
available protease), 0.2 mg Lactocide & urea (0.85 ml of a 10- fold dilution of 50% Urea Liquor). 
A hole is cut into the cap of the 100 ml bottle containing the mash to allow C0 2 to vent. The 
mash is then inoculated with yeast (1 .44 ml) and incubated in a water bath set at 90° C After 24 
hours of fermentation the temperature is lowered to 86° C; at 48 hours it is set to 82° C. 

Yeast for inoculation is propagated as described in Example 14. 

Samples are removed as described in example 14 and then analyzed by the methods 
described in Example 14. 

Example 42 

Construction of transformation vectors 

Expression cassettes were constructed to express the hyperthermophilic beta-glucanase 
EglA in maize as follows: 

pNOV4800 comprises the barley Amy32b signal peptide 

(MGKNGNLCCFSLLLLLLAGLASGHQ) fused to the synthetic gene for the EglA beta- 
glucanase for targeting to the endoplasmic reticulum and secretion into the apoplast. The fusion 
was cloned behind the maize y-zein promoter for expression specifically in the endosperm. 

pNOV4803 comprises the barley Amy32b signal peptide fused to the synthetic gene for the 
EglA beta-glucanase for targeting to the endoplasmic reticulum and secretion into the apoplast. 
The fusion was cloned behind the maize ubiquitin promoter for expression throughout the plant. 

Expression cassettes were constructed to express the thermophilic beta-glucanase/mannanase 
6GP1 (SEQ ED NO: 85) in maize as follows: 



118 



WO 2005/096804 



PCT/US2004/007182 



pNOV4819 comprises the tobacco PR la signal peptide 

(MGFVLFSQLPSFLLVSTLLLFLVISHSCRA) fused to the synthetic gene for the 6GP1 beta- 
glucanase/mannanase for targeting to the endoplasmic reticulum and secretion into the apoplast. 
The fusion was cloned behind the maize y-zein promoter for expression specifically in the 
endosperm. 

pNOV4820 comprises the synthetic gene for 6GP1 cloned behind the maize y-zein promoter for 
cytoplasmic localization and expression specifically in the endosperm. 

pNOV4823 comprises the tobacco PR la signal peptide fused to the synthetic gene for the 6GP1 
beta-glucanase/mannanase with a C-terminal addition of the sequence KDEL for targeting to and 
retention in the endoplasmic reticulum. The fusion was cloned behind the maize y-zein promoter 
for expression specifically in the endosperm. 

pNOV4825 comprises the tobacco PR la signal peptide fused to the synthetic gene for the 6GP1 
beta-glucanase/mannanase with a C-terminal addition of the sequence KDEL for targeting to and 
retention in the endoplasmic reticulum. The fusion was cloned behind the maize ubiquitin 
promoter for expression throughout the plant. 

Expression cassettes were constructed to express the barley Amyl alpha-amylase (SEQ ID NO: 
87) in maize as follows: 

pNOV4867 comprises the maize y-zein N-terminal signal sequence fused to the barley Amyl 
alpha-amylase with a C-terminal addition of the sequence SEKDEL for targeting to and retention 
in the endoplasmic reticulum. The fusion was cloned behind the maize y-zein promoter for 
expression specifically in the endosperm. 

pNOV4879 comprises the maize y-zein N-terminal signal sequence fused to the barley Amyl 
alpha-amylase with a C-terminal addition of the sequence SEKDEL for targeting to and retention 
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in the endoplasmic reticulum. The fusion was cloned behind the maize globulin promoter for 
expression specifically in the embryo. 

pNOV4897 comprises the maize y-zein N-terminal signal sequence fused to the barley Amyl 
alpha-amylase for targeting to the endoplasmic reticulum and secretion into the apoplast. The 
fusion was cloned behind the maize globulin promoter for expression specifically in the embryo. 

pNOV4895 comprises the maize y-zein N-terminal signal sequence fused to the barley Amyl 
alpha-amylase for targeting to the endoplasmic reticulum and secretion into the apoplast. The 
fusion was cloned behind the maize y-zein promoter for expression specifically in the endosperm 

pNOV4901 comprises the gene for the barley Amyl alpha-amylase cloned behind the maize 
globulin promoter for cytoplasmic localization and expression specifically in the embryo. 

Expression cassettes were constructed to express the Rhizopus glucoamylase (SEQ ID NO: 50) 
in maize as follows: 

pNOV4872 comprises the maize y-zein N-terminal signal sequence fused to the synthetic gene 
for Rhizopus glucoamylase with a C-terminal addition of the sequence SEKDEL for targeting to 
and retention in the endoplasmic reticulum. The fusion was cloned behind the maize y-zein 
promoter for expression specifically in the endosperm. 

pNOV4880 comprises the maize y-zein N-terminal signal sequence fused to the synthetic gene 
for Rhizopus glucoamylase with a C-terminal addition of the sequence SEKDEL for targeting to 
and retention in the endoplasmic reticulum. The fusion was cloned behind the maize globulin 
promoter for expression specifically in the embryo. 

pNOV4889 comprises the maize y-zein N-terminal signal sequence fused to the synthetic gene 
for Rhizopus glucoamylase for targeting to the endoplasmic reticulum and secretion into the 
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apoplast. The fusion was cloned behind the maize globulin promoter for expression specifically 
in the embryo. 

pNOV4890 comprises the maize y-zein N-terminal signal sequence fused to the synthetic gene 
for Rhizopus glucoamylase for targeting to the endoplasmic reticulum and secretion into the 
apoplast. The fusion was cloned behind the maize y-zein promoter for expression specifically in 
the endosperm. 

pNOV4891 comprises the synthetic gene for Rhizopus glucoamylase cloned behind the maize y- 
zein promoter for cytoplasmic localization and expression specifically in the endosperm. 

Example 43 

Expression of the mesophilic Rhizopus glucoamylase in corn 

A variety of constructs were generated for the expression of the Rhizopus glucoamylase 
in corn. The maize y-zein and globulin promoters were used to express the glucoamylase 
specifically in the endosperm or embryo, respectively. In addition, the maize y-zein signal 
sequence and a synthetic ER retention signal were used to regulate the subcellular 
localization of the glucoamylase protein. All 5 constructs (pNOV4872, pNOV4880, 
pNOV4889, pNOV4890, and pNOV4891) yielded transgenic plants with glucoamylase 
activity detected in the seed. Tables 7 and 8 show the results for individual transgenic seed 
(construct pNOV4872) and pooled seed (construct pNOV4889), respectively. No 
detrimental phenotype was observed for any transgenic plants expressing this Rhizopus 
glucoamylase. 

Glucoamylase assay: Seed were ground to a flour and the flour was suspended in water. 
The samples were incubated at 30 degrees for 50 minutes to allow the glucoamylase to react 
with the starch. The insoluble material was pelleted and the glucose concentration was 
determined for the supernatants. The amount of glucose liberated in each sample was taken 
as an indication of the level of glucoamylase present. Glucose concentration was determined 
by incubating the samples with GOHOD reagent (300mM Tris/Cl pH7.5, glucose oxidase 
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(20U/ml), horseradish peroxidase (20U/ml), o-dianisidine 0.1 mg/ml) for 30 minutes at 37 
degrees C, adding 0.5 volumes of 12N H2S04, and measuring the OD540. 

Table 7 shows activity of the Rhizopus glucoamylase in individual transgenic corn seed 
(construct pNOV4872). 



Table 7 






U/g 


Seed 


flour 


Wild Type #1 


0.07 


Wild Type #2 


0.55 


Wild Type #3 


0.25 


Wild Type #4 


0.33 


Wild Type #5 


0.30 


Wild Type #6 


0.42 


Wild Type #7 


-0.01 


Wild Type #8 


0.31 


MD9L022156 #1 


5.17 


MD9L022156 #2 


1.66 


MD9L022156 #3 


7.66 


MD9L022156#4 


1.77 


MD9L022156 #5 


7.08 


MD9L0221 56 #6 


4.46 


MD9L0221 56 #7 


2.20 


MD9L022156 #8 


3.50 


MD9L023377 #1 


9.23 


MD9L023377 #2 


4.30 


MD9L023377 #3 


6.72 


MD9L023377 #4 


3.35 


MD9L023377 #5 


0.56 


MD9L023377 #6 


4.79 


MD9L023377 #7 


4.60 


MD9L023377 #8 


6.01 


MD9L023043 #1 


4.93 


MD9L023043 #2 


8.74 


MD9L023043 #3 


2.70 


MD9L023043 #4 


0.72 


MD9L023043 #5 


3.33 


MD9L023043 #6 


3.53 


MD9L023043 #7 


3.94 


MD9L023043 #8 


11.51 
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*L M >Pa*v aPa, P ^aa. _pa— . a™^ aPa A 

MD9L023334 


#1 


4.28 


MD9L023334 


#2 


2.86 


Am A ama^k a***. ■ *Pa a*™k. ^Pfc _| 

MD9L023334 


#3 


0.56 


A M Mhk a*W A A ,a*%, .a**. ^P- al 

MD9L023334 


#4 


6.96 


■k A 9aW am a aa — pav a\ #% _| 

MD9L023334 


#5 


3.29 


A> A a*a» • *P*v A -aw _aw A a 

MD9L023334 


#6 


3.18 


a> a aaaav aav ■ am— —Pa_ jia y»v ^"V aS 

MD9L023334 


#7 


4.57 


a> a aaa. aPa. A -aw aaW _aa Jfc 

MD9L023334 


#8 


7.44 


Pi a aav ala. a a aa a% Jlh a\ M 

MD9L022039 


#1 


_-a* A 

6.25 


at a aia _n _ a a> aaav. aaw am— appk aPak 

MD9L022039 


#2 


j*** ^p_ *" 

2.85 


MD9L022039 


#3 


4.32 


MD9L022039 


#4 


2.51 


MD9L022039 


#5 


5.06 


MD9L022039 


#6 


5.03 


MD9L022039 


#7 


2.79 


MD9L022039 


#8 


2.98 



Table 8 shows activity of the Rhizopus glucoamylase in pooled transgenic corn seed 
(construct pNOV4889). 



Table 8 





U/g 


Seed 


flour 


Wild Type 


0.38 


MD9L023347 


2.14 


MD9L023352 


2.34 


MD9L023369 


1.66 


MD9L023469 


1.42 


MD9L023477 


1.33 


MD9L023482 


1.95 


MD9L023484 


1.32 


MD9L024170 


1.35 


MD9L024177 


1.48 


MD9L024184 


1.60 


MD9L024186 


1.34 


MD9L024196 


1.38 


MD9L024228 


1.69 


MD9L024263 


1.70 


MD9L024315 


1.32 
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MD9L024325 



MD9L024333 



MD9L024339 



1.73 
1.41 
1.84 



All expression cassettes were inserted into the binary vector pNOV21 17 for 
transformation into maize via Agrobacterium infection. The binary vector contained the 
phosphomannose isomerase (PMI) gene which allows for selection of transgenic cells with 
mannose. Transformed maize plants were either self-pollinated or outcrossed and seed was 
collected for analysis. 



Example 44 

Expression of the hyperthermophilic beta-glucanase EglA in corn 

For expression of the hyperthermophilic beta-glucanase EglA in corn we utilized the 
ubiquitin promoter for expression throughout the plant and the y-zein promoter for expression 
specifically in the endosperm of corn seed. The barley Amy32b signal peptide was fiised to 
EglA for localization in the apoplast. 

Expression of the hyperthermophilic beta-glucanase EglA in transgenic corn seed and leaves was 
analysed using an enzymatic assay and western blotting. 

Transgenic seed segregating for construct pNOV4800 or pNOV4803 were analysed using 

both western blotting and an enzymatic assay for beta-glucanase. Endosperm was isolated from 

individual seed after soaking in water for 48 hours. Protein was extracted by grinding the 

endosperm in 50mM NaP04 buffer (pH 6.0). Heat -stable proteins were isolated by heating the 

extracts at 100 degrees C for 15 minutes, followed by pelleting of the insoluble material. The 

supernatant containing heat-stable proteins was analysed for beta glucanase activity using the 

azo-barley glucan method (megazyme). Samples were pre-incubated at 100 degrees C for 10 

minutes and assayed for 10 minutes at 100 degrees C using the azo-barley glucan substrate. 

Following incubation, 3 volumes of precipitation solution were added to each sample, the 

samples were centrifuged for 1 minute, and the OD590 of each supernatant was determined. In 

addition, 5ug of protein were separated by SDS-PAGE and blotted to nitrocellulose for western 
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blot analysis using antibodies against the EglA protein. Western blot analysis detected a 
specific, heat-stable protein(s) in the EglA positive endosperm extracts, and not in negative 
extracts. The western blot signal correlates with the level of EglA activity detected 
enzymatically. 

EglA activity was analysed in leaves and seed of plants containing the transgenic 
constructs pNOV4803 and pNOV4800, respectively. The assays (conducted as described above) 
showed that the heat-stable beta-glucanase EglA was expressed at various levels in the leaves 
(Table 9) and seed (Table 10) of transgenic plants while no activity was detected in non- 
transgenic control plants. Expression of EglA in corn utilizing constructs pNOV4800 and 
pNOV4803 did not result in any detectable negative phenotype. 

Table 9 shows the activity of the hyperthermophilic beta-glucanase EglA in leaves of 
transgenic corn plants. Enzymatic assays were conducted on extracts from leaves of pNOV4803 
transgenic plants to detect hyperthermophilic beta-glucanase acitivity. Assays were conducted at 
1 00 degrees C using the azo-barley glucan method (megazyme). The results indicate that the 
transgenic leaves have varying levels of hyperthermophilic beta-glucanase activity. 



Table 9 




Plant 


Abs590 


Wild Type 


0 


266A-17D 


0.008 


266A-18E 


0.184 


266A-13C 


0.067 


266A-15E 


0.003 


266A-11E 


0 


265C-1B 


0.024 


265C-1C 


0.065 


265C-2D 


0.145 


265C-5C 


0.755 


265C-5D 


0.133 


265C-3A 


0.076 


266A-4B 


0.045 


266A-12B 


0.066 


266A-11C 


0.096 
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266A-14B 
266A-4C 
266A-4A 
266A-12A 
266A-15B 
266A-11A 
266A-20C 
266A-19D 
266A-12C 
266A-4E 
266A-18B 
265C-3D 
266A-20E 
266A-13D 
265C-3B 
266A-15A 
266A-13A 
265C-3E 
266A-20A 
266A-20B 
266A-19C 
266A-20D 
266A-4D 
266A-18A 
265C-5E 
266A-17E 
266A-11B 
265C-4E 
265C-4D 



0.074 
0.107 
0.084 
0.054 
0.052 
0.109 
0.044 
0.02 
0.098 
0.248 
0.367 
0.066 
0.163 
0.084 
0.065 
0.131 
0.169 
0.116 
0.365 
0.521 
0.641 
0.561 
0.363 
0.676 
0.339 
0.221 
0.251 
0.138 
0.242 
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Table 10 shows the activity of the hyperthermophilic beta-glucanase EglA in seed of 
transgenic corn plants. Enzymatic assays were conducted on extracts from individual, 
segregating seed of pNOV4800 transgenic plants to detect hyperthermophilic beta-glucanase 
acitivity. Assays were conducted at 100 degrees C using the azo-barley glucan method 
(megazyme). The results indicate that the transgenic seed have varying levels of 
hyperthermophilic beta-glucanase activity. 



Table 10 




Seed 


Abs 590 


Wild Type 


0 


1A 


1.1 


1B 


0 


1C 


1.124 


1D 


1.323 


2A 


0 


2B 


1.354 


2C 


1.307 


2D 


0 


3A 


0.276 


3B 


0.089 


3C 


0.463 


3D 


0 


4A 


0.026 


4B 


0.605 


4C 


0.599 


4D 


0.642 


5A 


1.152 


5B 


1.359 


5C 


1.035 


5D 


0 


6A 


0.006 


6B 


1.201 


6C 


0.034 


6D 


1.227 


7A 


0.465 


7B 


0 


7C 


0.366 


7D 


0.77 


8A 


1.494 


8B 


1.427 
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8C 0.003 
8D 1.413 
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Effect of transgenic expression of endoglucanase EglA on cell wall composition & in vitro 
digestibility analysis 

Five individual seed from each of two lines, #263 & #266, not expressing or expressing 
Egla (pNOV4803) respectively were grown in the greenhouse. Protein extracts made from small 
leaf samples from immature plants were used to verify that transgenic endoglucanase activity 
was present in #266 plants but not #263 plants. At full plant maturity, -30 days after pollination, 
the whole above ground plant was harvested, roughly chopped, and oven dried for 72 hours. 
Each sample was divided into 2 duplicate samples (labelled A & B respectively), and subjected 
to in vitro digestibility analysis using strained rumen fluid using common procedures (Forage 
fiber analysis apparatus, reagents, procedures, and some applications, by H. K. Goering and 
P. J. Van Soest, Goering, H.Keith 1941 (Washington, D.C.) : Agricultural Research Service, 
U.S. Dept. of Agriculture, 1970. iv, 20 p. : ill. Agriculture handbook ; no. 379 ), except that 
material was treated by a pre-incubation at either 40°C or 90°C prior to in vitro digestibility 
analysis. In vitro digestibility analysis was performed as follows: 

Samples were chopped to about 1mm with a wiley mill, and then sub-divided into 16 
weighed aliquots for analysis. Material was suspended in buffer and incubated at either 40°C or 
90°C for 2 hours, then cooled overnight. Micronutrients, trypticase & casein & sodium sulfite 
were added, followed by strained rumen fluid, and incubated for 30 hours at 37°C. Analyses of 
neutral detergent fiber (NDF), acid detergent fiber (ADF) and acid detergent lignin (AD-L) were 
performed using standard gravimeteric methods (Van Soest & Wine, Use of Detergents in the 
Analysis of fibrous Feeds. IV. Determination of plant cell-wall constituents. P.J. Van Soest & 
R.H. Wine. (1967). Journal of The AOAC, 50: 50-55; see also Methods for dietry fiber, neutral 
detergent fiber and nonstarch polysaccharides in relation to animal nutrition (1991). P.J. Van 
Soest, J.B. Robertson & B.A. Lewis. J. Dairy Science, 74: 3583-3597.). 

Data show that transgenic plants expressing EglA (#266) contain more NDF than control 
plants (#233), whilst ADF & lignin are relatively unchanged. The NDF fraction of transgenic 
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plants is more readily digested than that of non-transgenic plants, and this is due to an increase in 
the digestibility of cellulose (NDF - ADF - AD-L), consistent with "self-digestion" of the cell- 
wall cellulose by the transgenically expressed endoglucanase enzyme. 

Example 45 

Expression of the thermophilic beta-glucanase/mannanase (6GP1) in corn 

Transgenic seed for pNOV4820 and pNOV4823 were analysed for 6GP1 beta glucanase 
activity using the azo-barley glucan method (megazyme). Enzymatic assays conducted at 50 
degrees C indicate that the transgenic seed have thermophilic 6GP1 beta-glucanase activity while 
no activity was detected in non-transgenic seed (positive signal represents background noise 
associated with this assay). 

Table 1 1 shows activity of the thermophilic beta-glucanase/mannanase 6GP1 in 
transgenic corn seed. Transgenic seed for pNOV4820 (events 1-6) and pNOV4823 (events 7-9) 
were analysed for 6GP1 beta-glucanase activity using the azo-barley glucan method 
(megazyme). Enzymatic assays were conducted at 50 degrees C and the results indicate that the 
transgenic seed have thermophilic 6GP1 beta-glucanase activity while no activity is detected in 
non-transgenic seed. 



Table 1 1 

Seed Abs 590 

Wild Type 0 

1 0.21 

2 0.31 

3 0.36 

4 0.23 

5 0.16 

6 0.14 

7 0.52 

8 0.54 

9 0.49 



Example 46 
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Expression of the mesophilic barley Amvl amylase in corn 

A variety of constructs were generated for the expression of the barley Amyl alpha- 
amylase in corn. The maize y-zein and globulin promoters were used to express the amylase 
specifically in the endosperm or embryo, respectively. In addition, the maize y-zein signal 
sequence and a synthetic ER retention signal were used to regulate the subcellular localization of 
the amylase protein. All 5 constructs (pNOV4867, pNOV4879, pNOV4897, pNOV4895, 
pNOV4901) yielded transgenic plants with alpha-amylase activity detected in the seed. Table 12 
shows the activity in individual seed for 5 independent, segregating events (constructs 
pNOV4879 and pNOV4897). All of the constructs produced some transgenic events with a 
shrivelled seed phenotype indicating that synthesis of the barley Amyl amylase could effect 
starch formation, accumulation, or breakdown. 

Table 12 shows activity of the barley Amyl alpha-amylase in individual corn seed 
(constructs pNOV4879 and pNOV4897). Individual, segregating seed for constructs pNOV4879 
(seed samples 1 and 2) and pNOV4897 (seed samples 3-5) were analysed for alpha-amylase 
activity as described previously. 



Table 12 



Seed 


U/g corn flour 


1A 


19.29 


IB 


1.49 


1C 


18.36 


ID 


1.15 


IE 


1.62 


IF 


14.99 


1G 


1.88 


1H 


1.83 


2A 


2.05 


2B 


36.79 
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2C 
2D 
2E 
2F 
2G 
2H 
3A 
3B 
3C 
3D 
3E 
3F 
3G 
3H 
4A 
4B 
4C 
4D 
4E 
4F 
4G 
4H 
5A 
5B 
5C 
5D 
5E 
5F 
5G 
5H 



30.11 

2.25 

32.37 

1.92 

20.24 

35.76 

22.99 

1.72 

25.38 

18.41 

28.51 

2.11 

16.67 

1.89 

1.57 

36.14 

23.35 

1.70 

1.94 

14.38 

2.09 

I. 83 

II. 64 
18.20 
1.87 
2.07 
1.71 
1.92 
12.94 
15.25 



Example 47 
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Preparation of Xylanase Constructs 

Table 13 lists 9 binary vectors that each contain a unique xylanase expression cassette. The 
xylanase expression cassettes include a promoter, a synthetic xylanase gene (coding 
sequence), an intron (PEPC, inverted), and a terminator (35S). 

Two synthetic maize-optimized endo-xylanase genes were cloned into binary vector 
pNOV21 17. These two xylanase genes were designated BD7436 (SEQ ID NO: 61) and 
BD6002A (SEQ ED NO:63). Additional binary vectors containing a third maize-optimized 
sequence, BD6002B (SEQ ID NO:65) can be made. 

Two promoters were used: the maize gIutelin-2 promoter (27-kD gamma-zein promoter 
(SEQ ID NO: 12 ) and the rice glutelin-1 (Osgtl) promoter (SEQ ID NO: 67). The first 6 
vectors listed in Table 1 have been used to generate transgenic plants. The last 3 vectors can 
also be made and used to generate transgenic plants. 

Vector 1 1560 and 1 1562 encode the polypeptide shown in SEQ ED NO: 62 (BD7436). 
Constructs 1 1559 and 1 1561 encode a polypeptide consisting of SEQ ID NO: 17 fused to the N- 
terminus of SEQ ED NO: 62. SEQ ID NO: 17 is the 19 amino acid signal sequence from the 27- 
kD gamma-zein protein. 

Vector 12175 encodes the polypeptide shown in SEQ ID NO: 64(BD6002A). Vector 
12174 encodes a fusion protein consisting of the gamma-zein signal sequence (SEQ ED NO: 17) 
fused to the N-terminus of SEQ ED NO: 64. 

Vectors pWIN062 and pWIN064 encode the polypeptide shown in SEQ ED NO: 
66(BD6002B). Vector pWIN058 encodes a fusion protein consisting of the chloroplast transit 
peptide of maize waxy protein (SEQ ED NO: 68) fused to the N-terminus of SEQ ED NO: 66 . 
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Table 13 Xylanase binary vectors 



Vector 


rromoter 


cirrnal ^pniiptire Source 


Xvlanase Gene 


11559 


27kD Garnma-zein 


27kD Gamma-zein 


BD7436 


11560 


27kD Gamma-zein 


None 


BD7436 


11561 


OsGtl 


27kD Gamma-zein 


BD7436 


11562 


OsGtl 


None 


BD7436 


12174 


27kD Gamma-zein 


27kD Gamma-zein 


BD6002A 


12175 


27kD Gamma-zein 


None 


BD6002A 


PWIN058 


27kD Gamma-zein 


Maize waxy protein 


BD6002B 


PWIN062 


OsGtl 


None 


BD6002B 


PWIN064 


27kD Gamma-zein 


None 


BD6002B 



All constructs include an expression cassette for PMI, to allow positive selection of 
regenerated transgenic tissue on mannose-containing media. 

Example 48 

Xvlanase Activity Assay Results 

The data shown in Tables 14 and 15 demonstrate that xylanase activity accumulates in Tl 
generation seed harvested from regenerated (TO) maize plants stably transformed with binary 
vectors containing xylanase genes BD7436 (SEQ ID NO: 61 in Example 47) and BD6002A 
(SEQ ID NO:63 in Example 47). Using an Azo-WAXY assay (Megazyme), activity was 
detected in extracts from both pooled (segregating) transgenic seed and single transgenic seed. 

Tl seed were pulverized and soluble proteins were extracted from flour samples using 
citrate-phosphate buffer (pH 5.4). Flour suspensions were stirred at room temperature for 60 
minutes, and insoluble material was removed by centrifiigation. The xylanase activity of the 
supernatant fraction was measured using the Azo-WAXY assay (McCleary, B.V. "Problems in 
the measurement of beta-xylanase, beta-glucanase and alpha-amylase in feed enzymes and 
animal feeds". In proceedings of Second European Symposium on Feed Enzymes" (W.van 
Hartingsveldt, M. Hessing, J.P. van der Jugt, and W. A.C Somers Eds.), Noordwiijkerhout, 
Netherlands, 25-27 October, 1995). Extracts and substrate were pre-incubated at 37°C. To 1 
volume of IX extract supernatant, 1 volume of substrate (1% Azo-Wheat Arabinoxylan S- 
AWAXP) was added and then incubated at 37°C for 5 minutes. Xylanase activity in the corn 
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flour extract depolymerizes the Azo- Wheat Arabinoxylan by an endo-mechanism and produces 
low molecular weight dyed fragments in the form of xylo-oligomers. After the 5 minute 
incubation, the reaction was terminated by the addition of 5 volumes of 95% EtOH. Addition of 
alcohol causes the non-depolymerized dyed substrate to precipitate so that only the lower 
molecular weight xylo-oligomers remain in solution. Insoluble material was removed by 
centrifugation. The absorbance of the supernatant fraction was measured at 590nm, and the units 
of xylanase per gram of flour were determined by comparison to the absorbance values from 
identical assays using a xylanase standard of known activity. The activity of this standard was 
determined by a BCA assay. The enzyme activity of the standard was determined using wheat 
arabinoxylan as substrate and measuring the release of reducing ends by reaction of the reducing 
ends with 2,2'-bicinchoninic acid (BCA). The substrate was prepared as a 1.4% w/w solution of 
wheat arabinoxylan (Megazyme P-WAXYM) in 100 mM sodium acetate buffer pH5.30 
containing 0.02% sodium azide. The BCA reagent was prepared by combining 50 parts reagent 
A with 1 part reagent B (reagents A and B were from Pierce, product numbers 23223 and 23224, 
respectively). These reagents were combined no more than four hours before use. The assay 
was performed by combining 200 microliters of substrate to 80 microliters of enzyme sample. 
After incubation at the desired temperature for the desired length of time, 2.80 milliliters of BCA 
reagent was added. The contents were mixed and placed at 80°C for 30-45 minutes. The 
contents were allowed to cool and then transferred to cuvettes and the absorbance at 560nm was 
measured relative to known concentrations of xylose. The choice of enzyme dilution, incubation 
time, and incubation temperature could be varied by one skilled in the art. 

The experimental results shown in Table 14 demonstrate the presence of recombinant 
xylanase activity in flour prepared from Tl generation corn seed. Seed from 12 TO plants 
(derived from independent T-DNA integration events) were analyzed. The 12 transgenic events 
were derived from 6 different vectors as indicated (refer to Table 13 in Example 47 for 
description of vectors). Extracts of non-transgenic (negative control) corn flour do not contain 
measurable xylanase activity (see Table 15). The xylanase activity in these 12 samples ranged 
from 10-87 units/gram of flour. 
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Table 14. Analysis of pooled Tl seed. 



Vector 


Sample 


Xylanase Units / Gram of Flour 


11559 


MD9L013800 


63 


11559 


MD9L012428 


58 


11560 


MD9L011296 


33 


11560 


MD9L011322 


21 


11561 


MD9L012413 


87 


1 1561 


MD9L012443 


oo 


11562 


MD9L012890 


13 


11562 


MD9L013788 


12 


12174 


MD9L022080 


16 


12174 


MD9L022195 


10 


12175 


MD9L022061 


74 


12175 


MD9L022134 


69 



The results in Table 15 demonstrate the presence of xylanase activity in corn flour 
derived from single kernels. Tl seed from two TO plants containing vectors 1 1561 and 1 1559 
were analyzed. These vectors are described in Example 47. Eight seed from each of the two 
plants were pulverized and flour samples from each seed were extracted. The table shows results 
of single assays of each extract. No xylanase activity was found in assays of extracts of seeds 1, 
5, and 8 for both transgenic events. These seed represent null segregants. Seed 2, 3, 4, 6, and 7 
for both transgenic events accumulated measurable xylanase activity attributable to expression of 
the recombinant BD7436 gene. All 10 seed that tested positive for xylanase activity (>10 
unit/gram flour) had an obvious shriveled or shrunken appearance. By contrast the 6 seed that 
tested negative for xylanase activity (<1 unit/gram flour) had a normal appearance. This result 
suggests that the recombinant xylanase depolymerized endogenous (arabino)xylan substrate 
during seed development and/or maturation. 



Table 15. Analysis of single Tl seed. 



Vector 11561 



Vector 11559 
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Seed 


Xylanase Units / 


Seed 


Xylanase Units / 


Number 


Gram of Flour 


Number 


Gram of Flour 


1 


0 


1 


1 


2 


45 


2 


52 


3 


38 


3 


21 


4 


40 


4 


13 


5 


0 


5 


0 


6 


40 


6 


28 


7 


32 


7 


23 


8 


0 


8 


0 



Example 49 

Enhanced starch recovery from corn seed using enzymes 

Corn wet-milling includes the steps of steeping the corn kernel, grinding the corn kernel, 
and separating the components of the kernel. A bench top assay (the Cracked Corn Assay) was 
developed to mimic the corn wet-milling process 

The "Cracked Corn Assay" was used for identifying enzymes that enhance starch yield 
from maize seed resulting in an improved efficiency of the corn wet milling process. Enzyme 
delivery was either by exogenous addition, transgenic corn seed, or a combination of both. In 
addition to the use of enzymes to facilitate separation of the corn components, elimination of S0 2 
from the process is also shown. 

Cracked Corn Assay. 

One gram of seed was steeped overnight in 4000, 2000, 1000, 500, 400, 40, or 0 ppm 
S0 2 at 50 degrees C or 37 degrees C. Seeds were cut in half and the germ removed. Each half 
seed was cut in half again. Steep water from each steeped seed sample was retained and diluted 
to a final concentrations ranging from 400 ppm to 0 ppm S0 2 . Two milliliters of the steep water 
with or without enzymes was added to the de-germed seeds and the samples placed at 50 
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degrees C or 37degrees C for 2-3 hours. Each enzyme was added at 10 units per sample. All 
samples were vortex ed approximately every 15 minutes. After 2-3 hours the samples were 
filtered through mira cloth into a 50ml centrifuge tube. The seeds were washed with 2 ml of 
water and the sample pooled with the first supernatant. The samples were centrifuged for 15 
minutes at 3000 rpm. Following centrifugation, the supernatant was poured off and the pellet 
placed at 37 degrees C to dry. All pellet weights were recorded. Starch and protein 
determinations ware also carried out on samples for determining the starch rprotein ratios released 
during the treatments (data not shown). 



Anaylsis of Tl and T2 seed from maize plants expressing 6GP1 endoglucanase in 
Cracked corn Assay 

Transgenic corn (pNOV4819 and pNOV4823) containing a thermostable endoglucanse 
performed well when analyzed in the Cracked Corn Assay. Recovery of starch from the 
pNOV4819 line was found to be 2 fold higher in seeds expressing the endoglucanase when 
steeped in 2000 ppm SO2. Addition of a protease and cellobiohydrolase to the endoglucanse 
seed increased the starch recovery approximately 7 fold over control seeds. See Table 16. 



Table 16. Crack Corn Assay results for cytosolic expressed 
Endoglucanase (pNOV4820). Control line, A188/HiII 
PNOV4819 lines, 42C6A-1-21 and 27. 







^^^9 


A188/HiII Control 


No Enzyme 


28.4 


A188/HiII Control 


Bromelain/C8546 10U 


109.3 








42C6A-1-21 


No Enzyme 


52.6 


42C6A-1-21 


Bromelain/C8546 10U 


170.4 








42C6A-1-27 


No Enzyme 


60.5 


42C6A-1-27 


Bromelain/C8546 10U 


207.5 



Similar results 



were seen in transgenic 
seed containing 
endoglucanase targeted 



to the ERof the 
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endosperm (pNOV4823), again resulting in a 2 -7 fold increase in starch recovery when 
compared to control seed. See Table 17. 



Table 1 7. Crack Corn Assay results for ER expressed 



endoglucanase (pNOV4823). Control line, A188/HiII; 

PNOV4823 line, 101D1 1 A-l-28. 



Line 


Treatment 


Starch 
Pellet Wt 
(mg) 


Starch 
Pellet Wt 
(mg) 


Mean Wt. 


A188/HHI 


No Enzyme 


22.5 


19.1 


20.8 


101D11A-1-28 


No Enzyme 


41.2 


32 


36.6 












A188/HHI 


10U Bromelian/C8546 


78.6 


73.8 


76.2 


101D11A-1-28 


10U Brome!ian/C8546 


169.8 


132.6 


151.2 



These results confirm that expression of an endoglucanase enhances the separation of starch and 
protein components of the corn seed. Further more it could be shown that reduction or removal 
of S02 during the steeping process resulted in starch recovery that was comparable to or better 
than normally steeped control seeds. See Table 18. Removal of high levels of S02 from the 
wet-milling process can provide value-added benefits. 

Table 18. Comparison of various concentrations of S02 on starch recovery from transgenic 



6GP1 seed. 



Line 


Treatment 


Starch Pellet Wt 
(mg) 


A188 Control 


2000 ppm S02 


18.5 


JHAF Control 


2000 ppm S02 


29.1 


42C (pNOV4820) 


2000 ppm S02 


29.5 


101C (pNOV4823) 


2000 ppm S02 


73.1 


101D (pNOV4823) 


2000 ppm S02 


42.5 


136A (pNOV4825) 


2000 ppm SQ2 36.6 
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137A (pNOV4825) 


2000 ppm S02 


38.6 








42C (pNOV4820) 


400 ppm S02 


18.5 


101C (pNOV4823) 


400 ppm S02 


20.4 


101D(pNOV4823) 


400 ppm S02 


39.7 


136A (pNOV4825) 


400 ppm S02 


26 


137A (pNOV4825) 


400 ppm S02 


26.9 








42C (pNOV4820) 


0 ppm S02 


21.9 


101C (pNOV4823) 


0 ppm S02 


32.5 


101D (pNOV4823) 


0 ppm S02 


39 


136A (pNOV4825) 


0 ppm S02 


17.8 


137A (pNOV4825) 


0 ppm S02 


29.2 



Example 50 

Construction of transformation vectors for maize optimized bromelain 

Expression cassettes were constructed to express the maize optimized bromelain in maize 
endosperm with various targeting signals as follows: 

pSYNl 1000 (SEQ ID NO. 73 ) comprises the bromelain signal sequence 
(MAWKVQVVFLFLFLCVM WASPS AAS A ) (SEQ ID NO: 72) and synthetic bromelain 
sequence fused with a C-terminal addition of the sequence VFAEAIAANSTLVAE for targeting 
to and retention in the PVS (Vitale and Raikhel Trends in Plant Science Vol 4 no.4 pg 149-155). 
The fusion was cloned behind the maize gamma zein promoter for expression specifically in the 
endosperm. 

pSYNl 1587 (SEQ ID NO:75) comprises the bromelain N-terminal signal sequence 
(MA WKVQ VVFLFLFLC VMW ASPS AAS A ) and synthetic bromelain sequence with a C- 
terminal addition of the sequence SEKDEL for targeting to and retention in the endoplasmic 
reticulum (ER) (Munro and Pelham, 1987). The fusion was cloned behind the maize gamma 
zein promoter, for expression specifically in the endosperm. 

pSYNl 1589 (SEQ ED NO. 74) comprises the bromelain signal sequence 
(MA WKVQ VVFLFLFLC VMW ASPS AAS A) (SEQ ID NO: 72) fused to the lytic vacuolar 
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targeting sequence SSSSFADSNPERVTDRAAST (Neuhaus and Rogers Plant Molecular 
Biology 38:127-144, 1998) and synthetic bromelain for targeting to the lytic vacuole. The 
fusion was cloned behind the maize gamma zein prmoter for expression specifically in the 
endosperm. 

pSYN12169 (SEQ ED NO: 76) comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) fused to the synthetic bromelain for targeting 
to the endoplasmic reticulum and secretion into the apoplast (Torrent et al. 1997). The fusion 
was cloned behind the maize gamma zein promoter for expression specifically in the endosperm. 

pSYN12575 (SEQ ID NO: 77) comprises the waxy amyloplast targeting peptide (Klosgen 
et al., 1986) fused to the synthetic bromelain for targeting to the amyloplast. The fusion was 
cloned behind the gamma zein promoter for expression specifically in the endosperm. 

pSM270 ( SEQ ED NO.78 ) comprises the bromelain N-terminal signal sequence fused 
to the lytic vacuolar targeting sequence SSSSFADSNPERVTDRAAST (Neuhaus and Rogers 
Plant Molecular Biology 38:127-144, 1998) and synthetic bromelain for targeting to the lytic 
vacuole. The fusion was cloned behind the aleurone specific promoter P 19 (US Patent 6392123) 
for expression specifically in the aleurone. 

Example 51 

Expression of bromelain in corn 

Seeds from Tl transgenic lines transformed with vectors containing the synthetic 
bromelain gene with targeting sequences for expression in various subcellular location of the 
seed were analyzed for protease activity. Corn-flour was made by grinding seeds, for 30 sec, in 
the Kleco grinder. The enzyme was extracted from 100 mg of flour with 1 ml of 50 mM NaOAc 
pH4.8 or 50 mM Tris pH 7.0 buffer containing ImM EDTA and 5 mM DTT. Samples were 
vortexed, then placed at 4C with continuous shaking for 30 min. Extracts from each transgenic 
line was assayed using resorufin labeled casein (Roche, Cat. No. 1 080 733) as outlined in the 
product brochure. Flour from T2 seeds were assayed using a bromelain specific assay as 
outlined in Methods in Enzymology Vol. 244: Pg 557-558 with the following modifications. 
lOOmg of corn seed flour was extracted with 1ml of 50mMNa 2 HPO4/50mM NaH 2 P0 4 , pH 7.0, 1 
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mM EDTA +/- IfxM leupeptin for 15 min at 4°C. Extracts were centrifiiged for 5 min at 14,000 
rpm at 4°C. Extracts were done in duplicates. .Flour from T2 Transgenic lines was assayed for 
bromelain activity using Z-Arg-Arg-NHMec (Sigma) as a substrate. Four aliquots of 100/d /corn 
seed extracts were added to 96 well flat bottom plates (Corning) containing 50/il 
100mMNa 2 HPO4/100mM NaH 2 P0 4 , pH 7.0, 2mM EDTA, 8mM DTT/well. The reaction was 
started by the addition of 50/xl of 20/xM Z-Arg-Arg-NHMec. The reaction rate was monitor 
using a SpectraFluorPlus(Tecan) fitted with a 360nm excitation and 465nm emission filters at 
40°C at 2.5min intervals. 

Table 19 shows the analysis of seed from different Tl bromelain events. Bromelain 

expression was found to be 2-7 fold higher than the A 188 and JHAF control lines. Tl 

transgenic lines were replanted and T2 seeds obtained. Analysis of T2 seeds showed 

expression of bromelain. Figure 21 shows bromelain activity assay using Z-Arg-Arg-NHMec_in 

T2 seed for ER targeted (11 587) and lytic vacuolar targeted (1 1589) bromelain. 

Analysis of T2 seed from maize plants expressing Bromelain 

Seed from T2 transgenic bromelain line, 1 1587-2 was analyzed in the Cracked Corn 

assay for enhanced starch recovery. Previous experiments using exogenously added bromelain 

showed an increased starch recovery when tested alone and in combination with other enzymes, 

particularly cellulases. The T2 seed from line 1 1587-2 showed a 1.3 fold increase in starch 

recovered over control seed when steeped at 37C/2000 ppm S02 overnight. More importantly, 

there was the 2 fold increase in starch from the T2 bromelain line, 1 1587-2 when a cellulase 

(C8546) was added when seeds were steeped at 37C/2000 ppm S02. 

The transgenic line showed a similar trend in increased starch over control seed when 

seeds were steeped at 37C/400 ppm S02. A 1.6 fold increase starch recovered over control was 
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seen in the transgenic seed and a 2.1 fold increase of starch with addition of a cellulase (C8546). 
See Table 20. 

These results are significant in showing that it is possible to reduced temperature and 
S02 levels while also enhancing the starch recovery during the wet-milling process when 
transgenic seed expressing a bromelain is used. 



Table 19 



Summary of Grain Specific Expression of Bromelain in Tl corn. 



Line Number 


Targeting 


Construct 


"Specific Activity" ng 
Bromelain/protein 


11000-1 


Vacuolar 


GZP/probromelain/barleyPVS 


252 


11000-2 


Vacuolar 


GZP/probromelain/barleyPVS 


277 


1 1 000-3 


Vacuolar 


GZP/probromelain/barleyPVS 


284 


11587-1 


ER 


GZP/probromelain/KDEL 


174 


11587-1 


ER 


GZP/probromelain/KDEL 


153 


11589-1 


Lytic Vacuolar 


GZP/aleurainSS/probromelain 


547 


1 1 589-2 


Lytic Vacuolar 


GZP/aleurainSS/probromelain 


223 






A188 Control 


56 






JHAF Control 


75 



Table 20 Cracked Corn Assay results for T2 Bromelain seed 



Steep Conditions 


Line : 


Starch Pellet Wt. (mg) 


2000 ppm S02 


A188 


41.3 


2000 ppm S02 


A188/C8546 (10 units) 


44 


2000 ppm S02 


1 1 587-2 


57.4 


2000 ppm S02 


1 1 587-2/C8546 (10 units) 


94.6 


400 ppm 


A188 


30.7 


400 ppm 


A188/C8546 (10 units) 


35.8 


400 ppm 


11587-2 


50.5 


400 ppm 


1 1 587-2/C8546 (10 units) 


86.6 
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Example 52 

Construction of transformation vectors for maize optimized ferulic acid esterase. 
Expression cassettes were constructed to express the maize optimize ferulic acid esterase 

in maize endosperm with or without various targeting signals as follows: 

Plasmid 13036 (SEQ ID NO: 101) comprises the maize optimize ferulic acid esterase 

(FAE) sequence (SEQ ID NO: 99). The sequence was cloned behind the maize gamma zein 

promoter without any targeting sequences for expression specifically in the cytosol of the 

endosperm. 

Plasmid 13038 (SEQ ID NO: 103) comprises the maize y-zein N-terminal signal 
sequence (MRVLLVALALLALAASATSXSEQ ID NO: 17) fused to the synthetic FAE for 
targeting to the endoplasmic reticulum and secretion into the apoplast (Torrent et al. 1997). The 
fusion was cloned behind the maize gamma zein promoter for expression specifically in the 
endosperm. 

Plasmid 13039 (SEQ ID NO: 105) comprises the waxy amyloplast targeting peptide 
(MLAALATSQLVATRAGLGVPDASTFRRGAAQGLRGARASAAAD 
TLSMRTSARAAPRHQHQQARRGARFPSLVVCASAGA) (Klosgen et al., 1986) fused to the 
synthetic FAE for targeting to the amyloplast. The fusion was cloned behind the gamma zein 
promoter for expression specifically in the endosperm. 

Plasmid 13347 (SEQ ID NO: 107) comprises the maize y-zein N-terminal signal 
sequence (MRVLLVALALLALAASATSXSEQ ID NO: 17) fused to the synthetic FAE sequence 
with a C-terminal addition of the sequence SEKDEL for targeting to and retention in the 
endoplasmic reticulum (ER) (Munro and Pelham, 1987). The fusion was cloned behind the 
maize gamma zein promoter_for expression specifically in the endosperm. 

All expression cassettes were moved into a binary vector pNOV21 17 for transformation 
into maize via Agrobacterium infection. The binary vector contained the phosphomannose 
isomerase (PMI) gene which allows for selection of transgenic cells with mannose. Transformed 
maize plants were either self-pollinated or outcrossed and seed was collected for analysis. 
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Combinations of the enzymes can be produced either by crossing plants expressing the 
individual enzymes or by cloning several expression cassettes into the same binary vector to 
enable cotransformation. 



Synthetic Ferulic Acid Esterase Sequence (SEP ID NO: 99) 

atggccgcctccctccceaccatgcceccgtccggctacgaccaggtgcgcaacggcgtgcc2Cgcggccaggtggtgaacatctcctacttctccaccs ccaccaa 

ctccacccgcccggcccgcgtgtacctcccgccgggctactccaaggacaagaagtactccgtgctctacctcctccacggcatcggcggctccgagaacfiactgRtt 

cgagggcggcggccgcgccaacgtgatcgccgacaacctcatcgccgagggcaagatcaagc cgctcatcatcgtgaccccgaacaccaacRCCRccggcccgg 

gcatcgccgacggctacgagaacttcaccaaggacctcctcaactccctcatcccgtacatcgagtccaactactccgtgtacaccgaccgcgagcaccgcgccatCRC 

cggcctctctatgggcggcggccagtccttcaacatcggcctcaccaacctcgacaagttcgcctacatcggcccgatctccgccgccccgaacacctacccsaacga 

gcgcctcttcccggacggcggcaaggccgcccgcgagaagctcaagctcctcttcatcgcctgcggcaccaac gactccctcatcggcttcggccagcgcgtgcacg 

agtactgcgtggccaacaacatcaaccacgtgtactggctcatccagggcggcg g ccacgacttcaacgtgtggaagccgggcctctggaacttcctccagatggccg 

acgaggccggcctcacccgcgacggcaacaccccggtgccgaccccgtccccgaagccggcca acacccgcatcgaggccgaggactacgacggcatcaactcc 

tcctccatcgagatcatcggcgtgccgccggagggcggccgcggcatcggctacatcacctccg g cgactacctcgtgtacaagtccatcgacttcggcaacggcgcc 

acctccttcaaggccaaggtggccaacgccaacacctccaacatcgagcrtcgcctcaacggcc cgaacggcaccctcatcggcaccctctccgtgaagtccaccggc 

gactggaacacctacgaggagcagacctgctccatctccaaggtgaccggcatcaacgacctctacctcgtgttcaagggcccggtgaacatcgactggttcacc ttcg 

gcgtgtag 

Synthetic Ferulic Acid Esterase Amino Acid Sequence (SEP ID NO: 100) 

maaslprmppsgvdqvrngvprgqvvmsvfstam^ 

gjadgyenftkdUnsl^^ 

vannirJiyywliqggghdfnvwkpglvvTinqmadeagln-dgntpvpmsplcp 
sikakvanantsruelrlngpngtligtlsvkstgdwnWeeqtcsiskvtgin dlvlvfkgpvrudwr^^ 



13036 Sequence (SEP ED NO: 101) 

atggccgcctccctcccgaccatgccgccgtccggctacgaccaggtgcgcaacggcgtgccgcgcggccaggtggtgaacatctcctacttctccacc gccaccaa 

ctccacccgcccggcccgcgtgtacctcccgccgggctactccaaggacaagaagtactccg tgctctacctcctccacggcatcggcggctccgagaacgactggtt 

cgagggcggcggccgcgccaacgtgatcgccgacaacctcatcgccgagggcaagatcaag ccgctcatcatcgtgaccccgaacaccaacgccgccggcccgg 

gcatcgccgacggctacgagaacttcaccaaggacctcctcaactccctcatcccgtacatcgag tccaactactccgtgtacaccgaccgcgagcaccgcgccatcgc 

cggcctctctatgggcggcggccagtccttcaacatcggcctcaccaacctcgacaagttcgcctacatcggccc gatctccgccgccccgaacacctacccgaacga 

gcgcctcttcccggacggcggcaaggccgcccgcgagaagctcaagctcctcttcatcgcctg cggcaccaacgactccctcatcggcttcggccagcgcgtgcacg 

agtactgcgtggccaacaacatcaaccacgtgtactggctcatccagggcggcggcc acgacttcaacgtgtggaagccgggcctctggaacttcctccagatggccg 

acgaggccggcctcacccgcgacggcaacaccccggtgccgaccccgtcccc g aagccggccaacacccgcatcgaggccgaggactacgacggcatcaactcc 

tcctccatcgagatcatcggcgtgccgccggagggcggccgcggcatcggctacatcacctccg g cgactacctcgtgtacaagtccatcgacttcggcaacggcgcc 

acctccttcaaggccaaggtggccaacgccaacacctccaacatcgagcttcgcctcaacggcccgaacggca ccctcatcggcaccctctccgtgaagtccaccggc 

gactggaacacctacgaggagcagacctgctccatctccaaggtgaccggcatcaacg acctctacctcgtgttcaagggcccggtgaacatcgactggttcaccttcg 

gcgtgtag 

13036 AA Sequence (SEP ID NO: 102) 

maaslprmppsgvdqvrngvprgq\^svfo^ 

giadgyenftkdnnslipviesnvsvvtdrehraiaglsm 

vaniunh\nAvliqggghdfnvwkiDgl\ynflqmadeagltrdgntpv^m 

sikakyanantsnielrlngpngtligtlsvkstgdvyntveeqtcsiskytgindlvlvikgpvm 

13038 Sequence (SEP IP NP:103) 

atgagggtgttgctcgttgccctcgctctcctggctctcgctgcgagcgccacctccatggccgcc tccctcccgaccatgccgccgtccggctacgaccaggtgcgca 
acggcgtgccgcgcggccaggtggtgaacatctcctacttctccaccgccaccaactccacccgccc ggcccgcgtgtacctcccgccgggctactccaaggacaag 
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aagtactccgtpctctacctcctccacggcatcggcggctccgagaacgactggttcgagee cggcggccgcgccaacgtgatcgccgacaacctcatcgccgaggg 

caagatcaagccgctcatcatcgtgaccccgaacaccaacgccgccggcccgggcatc gccgacggctacgagaacttcaccaaggacctcctcaactccctcatccc 

gtacatcgagtccaactactccgtgtacaccgaccgcgagcaccgcgccatcgccggcctctctatggg cggcggccagtccttcaacatcggcctcaccaacctcgac 

aagttcgcctacatcggcccgatctccgccgccccgaacacctacccgaacga gcgcctcttcccggacggcggcaaggccgcccgcgagaagctcaagctcctctt 

catcgcctgcggcaccaacgactccctcatcggcttcggccagcgcgtgcacgagtactgcgtggccaacaac atcaaccacgtgtactggctcatccagggcggcgg 

ccacgacttcaacgtgtggaagccgggcctctggaacttcctccagatggccgacgaggccggcctcacccgcgacggcaacaccccggtgccgaccccgtccccR 

aagccggccaacacccgcatcgaggccgaggactacgacggcatcaactcctcctccatcgagatcatcggc gtgccgccggagggcggccgcggcatcggctac 

atcacctccggcgactacctcgtgtacaagtccatcgacttcggcaacggcgccacctccttcaaggccaaggt ggccaacgccaacacctccaacatcgagcttcgcc 

tcaacggcccgaacggcaccctcatcggcaccctctccgtgaagtccaccggcgactggaacacctacgaggagcagacctgctccatctccaaggtgaccggcatc 

aacgacctctacctcgtgttcaagggcccggtgaacatcgactggttcacc ttcggcgtgtag 

13038 AA Sequence (SEP ID NO: 104) 

mrvllvalallalaasatsmaaslptmppsgvdqvmgvprgqvvm^ 
gkikpliivmntnaagpgiadgvenftkdllnslipviesnvsvvtdrehraiaglsm 
cgtridsligfgqwhevcvanninhvvwliqggghdfnvwkpglwnflqmadeagltrd 
tsgdvlvvksidfgngatsflcakvanantsnielrlngpngtligtlsvkstgdwntveeqtcsiskvtgindlvlv^ 

13039 Sequence (SEP ID NO: 105) 

atgctggcggctctggccacgtcgcagctcgtcgcaacgcgcgccggcctgggcgtcccggacgcgtcca cgttccgccgcggcgccgcgcagggcctgagggg 

ggcccgggcgtcggcggcggcggacacgctcagcatgcggaccagcgcgcgcgcggcgc ccaggcaccagcaccagcaggcgcgccgcggggccaggttcc 

cgtcgctcgtcgtgtgcgccagcgccggcgccatggccgcctccctcccgaccatgccgccgtccggctacgaccaggtgcgcaacggcgtgccgcgcggccaggt 

ggtgaacatctcctacttctccaccgccaccaactccacccgcccggcccgcgtgtacctcccgccgggctac tccaaggacaagaagtactccgtgctctacctcctcc 

acggcatcggcggctccgagaacgactggttcgagggcggcggccgcgccaacgtgatcgccgacaacctcatcgccgagggcaagatcaagccgctcatcatcgt 

gaccccgaacaccaacgccgccggcccgggcatcgccgacggctacgagaacttcaccaaggacctcctcaactccctcatcccgtacatcga gtccaactactccgt 

gtacaccgaccgcgagcaccgcgccatcgccggcctctctatgggcggcggccagtccttcaacatcggcctcaccaacctcgaca agrtcgcctacatcggcccgat 

ctccgccgccccgaacacctacccgaacgagcgcctcttcccggacggcggcaaggccgcccgcgagaa gctcaagctcctcttcatcgcctgcggcaccaacgact 

ccctcatcggcttcggccagcgcgtgcacgagtactgcgtggccaacaacatcaaccacgtgtactggctca tccagggcggcggccacgacttcaacgtgtggaagc 

cgggcctctggaacttcctccagatggccgacgaggccggcctcacccgcgacggcaacaccc cggtgccgaccccgtccccgaagccggccaacacccgcatcg 

aggccgaggactacgacggcatcaactcctcctccatcgagatcatcggcgtgccgccggagggcggccgcggcatcggctacatcacctccggcgactacctcgtg 

tacaagtccatcgacttcggcaacggcgccacctccttcaaggccaaggtggccaacgccaacacctccaac atcgagcttcgcctcaacggcccgaacggcaccctc 

atcggcaccctctccgtgaagtccaccggcgactggaacacctacgaggagcagacctgctccatctccaaggtgaccggcatcaacgacctctacctcgtgttcaagg 

gcccggtgaacatcgactggttcaccttcggcgtgtag 

13039 AA Sequence (SEP ID NO; 106) 
mlaalatsqlvatraglgvpdastfagaaqglrgarasaaaddsm 
svfstamstroarvvlppgvskdkkvsvlvllhgiggsen 
hraiaglsmgggqsfaigltnldkfavigpisaapntvpn^ 

aflqmadeagltrdgiitpvptpspkpantrieaedvdginsssieiigvppeggTgigvitsgd 
stgdwntveeqtcsiskvtgindlylvfkgpvnidwftfgv* 

13347 Sequence (SEP ID NO: 107) 

atgagggtgttgctcgttgccctcgctctcctggctctcgctgcgagcgccacctccatggccgcctccctcccgac catgccgccgtccggctacgaccaggtgcgca 

acggcgtgccgcgcggccaggtggtgaacatctcctacttctccaccgccaccaactccacccgcccggcccgcgtgt acctcccgccgggctactccaaggacaag 

aagtactccgtgctctacctcctccacggcatcggcggctccgagaacgactggttcgagggcggcggccgcgccaacgtgatcgccgacaacctcatcgccgaggg 

caagatcaagccgctcatcatcgtgaccccgaacaccaacgccgccggcccgggcatcgccgacggctacga gaacttcaccaaggacctcctcaactccctcatccc 

gtacatcgagtccaactactccgtgtacaccgaccgcgagcaccgcgccatcgccggcctctctatgggcggcggccagtccttcaa catcggcctcaccaacctcgac 

aagttcgcctacatcggcccgatctccgccgccccgaacacctacccgaacgagcgcctcttcccggacggc ggcaaggccgcccgcgagaagctcaagctcctctt 

catcgcctgcggcaccaacgactccctcatcggcttcggccaecgcgtgcacgagtactgcgtggccaacaacatcaaccacgtgtactgg ctcatccagggcggcgg 

ccacgacttcaacgtgtggaagccgggcctctggaacttcctccagatggccgacgaggccggcctcacccgcgacggcaa caccccggtgccgaccccgtccccg 

aagccggccaacacccgcatcgaggccgaggactacgacggcatcaactcctcctccatcgagatcatcggcgtgccgccggagggcggccgcggcatcggctac 

atcacctccggcgactacctcgtgtacaagtccatcgacttcggcaacggcgccacctccttcaaggccaaggtg gccaacgccaacacctccaacatcgagcttcgcc 
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tcaacggcccp aac ppcaccctcatcggcaccctctccgtgaagtccaccggc ^actg gaaca cctacgaggagcagacctgctccatctccaaggtgaccggcatc 
aacgacctctac ctcpt gttcaagggcccggtgaacatcgactggttcaccttcggcgt gt c cgagaaR RacRaactctas 

13347 Sequence (SEP ID NO: 108) 

iru^llvalallalaasatsmaaslptmppsgydav^ 
gkikpliivtpntnaagpgiadgvenftkdllnslipvie^ 
cgtndsligfgqrvhevcvarminhvwliQggghdfhvw 
tsgdvlwksidfg n gatsfkakvanantsnielrlngpngU^ 



Example 53 

Hvdrolvtic degradation of corn fiber by ferulic acid esterase 
Corn fiber is a major by-product of corn wet and dry milling. The fiber component is 
composed primarily of course fiber arising from the seed pericarp (hull) and aleurone, with a 
smaller fraction of fine fiber coming from the endosperm cell walls. Ferulic acid, a 
hydroxycinnamic acid, is found in high concentrations in the cell walls of cereal grains resulting 
in a cross linking of lignin, hemicellulose and cellulose components of the cell wall. Enzymatic 
degradation of ferulate cross-linking is an important step in the hydrolysis of corn fiber and may 
result in the accessibility of further enzymatic degradation by other hydrolytic enzymes. 

Ferulic Acid Esterase Activity Assay 

Ferulic acid esterase, FAE-1, ( maize optimised synthetic gene from C. thermocellum) 
was expressed in E. coll Cells were harvested and stored at -80°C overnight. Harvested 
bacteria was suspended in 50mM Tris buffer pH7.5. Lysozyme was added to a final 
concentration of 200 ug/mL and the sample incubated 10 minutes at room temperature with 
gently shaking. The sample was centriftiged at 4 °C for 1 5 minutes at 4000 rpm. Following 
centrifugation, the supernatant was transferred to a 50 mL conical tube, and placed in 70 degree 
Celsius water bath for 30 minutes. The sample was then centriftiged for 15 minutes at 4000 rpm 
and the cleared supernatant transferred to a conical tube ( Blum et al. J Bacteriology, Mar 2000, 
pg 1346-1351.) 
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The recombinant FAE-1 was tested for activity using 4-methylumbelliferyl ferulate as 
described in Mastihubova et al (2002) Analytical Biochemistry 309 96-101. Recombinant 
protein FAE-1 (104-3) was diluted 10, 100, and 1000 fold and assayed. Activity assay results 
are shown in Figure 22. 

Preparation of Corn Seed Fiber 

Corn pericarp coarse fiber was isolated by steeping yellow dent #2 kernels for 48hrs at 50 
°C in 2000 ppm sodium metabisulfite( (Aldrich). Kernels were mixed with water in equal parts 
and blended in a Waring laboratory heavy duty blender with the blade in reverse orientation. 
Blender was controlled with a variable autotransformer (Staco Energy) at 50% voltage output for 
2 min. Blended material was washed with tap water over a standard test sieve #7(Fisher 
scientific) to separate coarse fiber from starch fractions. Coarse fiber and embryos were 
separated by floating the fiber way from the embryos with hot tap water in a 4L beaker (Fisher 
scientific). The fiber was then soaked in ethanol prior to drying overnight in a vacuum oven( 
Precision) at 60° C. Corn coarse fiber derived form corn kernel pericarp was milled with a 
laboratory mill 3100 fitted with a mill feeder 3 1 70(Perten instruments) to 0.5mm particle size. 

Corn Fiber Hydrolysis Assay 

Course fiber (CF) was suspended in 50 mM citrate-phosphate buffer, pH 5.2 at 30 mg/ 5 
ml buffer. The CF stock was vortexed and transferred to a 40 ml modular reservoir (Beckman, 
Cat. No. 372790). The solution was mixed well then 100 ul transferred to a 96 well plate 
(Corning Inc., Cat. No.9017, polystyrene, flat bottom). Enzyme was added at 1-10 ul/well and 
the final volume adjusted to 1 10 ul with buffer. CF background controls contained 10 ul of 
buffer only. Plates were sealed with aluminum foil and incubated at 37°C with constant shaking 
for 18 hours. The plates were centrifuged for 15 min at 4000 rpm. 1-10 ul of CF supernatant 
was transferred to a 96 well plate preloaded with 100 ul of BCA reagents (BCA-reagents: 
Reagent A (Pierce, Prod.# 23223), Reagent B (Pierce, Prod.# 23224). The final volume was 
adjusted to 1 10 ul. The plate was sealed with aluminum foil and placed at 85°C for 30 min. 
Following incubation at 85°C, the plate was centrifuged for 5 min at 2500 rpm. Absorbance 
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values were read at 562 nm (Molecular Devices, Spectramax Plus). Samples were quantified 
with D-glucose and D-xylose (Sigma) calibration curves. Assay results are reported as total 
sugar released. 

Measurement of total sugar released bv Ferulic Acid Esterase in Corn Seed Fiber Hydrolysis 
Assay 

Results from the recombinant FAE-1 fiber hydrolysis assay showed no increase in total 
reducing sugars (data not shown). These results were not unexpected since it has been reported 
in the literature that an increase in total reducing sugars is detectable only when other hydrolytic 
enzymes are used in combination with the FAE ( Yu et al J. Agric. Food Chem. 2003, 51,218- 
223). Figure 23 shows that addition of FAE-2 to a fungal supernatant which had been grown on 
corn fiber, shows and increase in total reducing sugars. This suggests that FAE does play an 
important role in corn fiber hydrolysis. 

Figure 23 shows Com Fiber Hydrolysis assay results showing increase in release of total 
reducing sugars from corn fiber with addition of FAE-2 to fungal supernatant (FS9). 

Analysis of Ferulic Acid released from corn seed fiber bv FAE-1 

FAE activity on corn fiber was tested by following the release of ferulic acid as described 
in Walfron and Parr (1996) ( Waldron. KW. Parr AJ 1996 Vol 7 paaes 305-312 Phytochem 
Anal ) with slight modification. Corn coarse fiber derived from corn kernel pericarp was milled 
with a laboratory mill 3100 fitted with a mill feeder 3170 (Perten instruments) to 0.5mm particle 
size and used as substrate at a concentration of 10 mg/ml. 1 ml assays were conducted in 24 well 
Becton Dickenson Multiwell™. Substrate was incubated in 50 mM citrate phosphate pH 5.4 at 
50° C at 1 10 rpm for 18 hrs in the presence and absence of recombinant FAE. After the 
incubation period, samples were centrifuged for 10 minutes at 13,000 rpm prior to ethyl acetate 
extraction. All solvents and acids used were from Fisher Scientific. 0.8 ml of supernatant was 
acidified with 0.5 ml acetic glacial acid and extracted three times with equivalent volume of 
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ethyl acetate. Organic fractions were combined and speed vac to dryness (Savant) at 40° C. 
Samples were then suspended with lOOjil of methanol and used for HPLC analysis. 

HPLC chromatography was carried out as follows. Ferulic acid (ICN Biomedicals) was 
used as standard in HPLC analysis (data not shown). HPLC analysis was conducted with a 
Hewlett Packard series 1 100 HPLC system. The procedure employed a Ci 8 fully capped reverse 
phase column (XterraRpis, 150mm X 3.9mm i.d. 5/zm particle size) operated in 1 .0 ml min ~ ] at 
40°C. Ferulic Acid was eluted with a gradient of 25 to 70 % B in 32 min (solvent A: H20, 
0.01%b TFA; solvent B: MeCN, 0.0075%). 

As shown in Figure 24, FA released from corn fiber was 2-3 fold higher than control 
when treated with 10 or 100 ul of FAE-1. These results clearly show that FAE-1 is capable of 
hydrolyzing corn fiber. 

Example 54 

Functionality in fermentation of maize expressed glucoamvlase and amylase 

This example demonstrates that maize-expressed enzymes will support fermentation of starch in 
a corn slurry in the absence of added enzyme and without cooking the corn slurry. Maize kernels 
that contain Rhizopus ozyzae glucoamylase (ROGA) (SEQ ID NO: 49) were produced as 
described in Example 32. Maize kernels that contain the barley low-pi a-amylase (AMYI) (SEQ 
ID NO: 88) are produced as described in Example 46. The following materials are used in this 
example: 

Aspergillus niger glucoamylase (ANGA)was purchased from Sigma. 

Rhizopus species glucoamylase (RxGA) was purchased from Wako as a dry crystalline 

powder and made up in 10 mM NaAcetate pH 5.2, 5 mM CaCl 2 . at 10 mg/ml. 
MAMYI Microbially produced AMYI was prepared at approximately 0.25 mg/ml in 10 

mM NaAcetate pH 5.2, 5 mM CaCl 2 . 
Yeast was Saccharomyces cereviceae 
YE was a sterile 5% solution of yeast extract in water 
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Yeast starter contained 50 g maltodextrin, 1.5 g yeast extract, 0.2 mg ZnS0 4 in a total 
volume of 300 ml of water, the medium was sterilized by autoclaving after 
preparation. After cooling to room temperature, 1 ml of tetracycline (10 mg/ml 
in ethanol), 100 p.1 AMG300 glucoamylase and 155 mg active dry yeast, were 
added. The mixture was then shaken at 30 °C for 22 h. The overnight yeast 
culture was diluted 1/10 with water and A600 measured to determine the yeast 
number, as described in Current Protocols in Molecular Biology. 

ROGA flour Kernels were pooled from several TO lines shown to have active 

glucoamylase The seeds were ground in the Kleco, and all flour was pooled . 

AMYI flour Kernels from TO corn expressing AMYI were pooled and ground as above. 

Control flour Kernels from with similar genetic background were ground in the same 
fashion as the ROGA expressing corn 

An inoculation mixture was prepared in a sterile tube; it contained per 1 .65 ml: yeast cells (lx 
10 7 ), yeast extract (8.6 mg), tetracycline (55 \ig). 1.65 ml was added / g flour to each 
fermentation tube. 

Fermentation preparation: Flour was weighed out at 1 .8 g / tube into tared 17 x 100 mm 
sterile polypropylene. 50 |al of 0.9 M H 2 S0 4 was added to bring the final pH prior to 
fermentation to 5. The inoculation mixture (2.1 ml) was added / tube, along with 
RXGA, AMYI-P and amylase desalting buffer as indicated below. The quantity of 
buffer was adjusted based on moisture content of each flour so that the total solids 
content was constant in each tube. The tubes were mixed throroughly, weighed and 
placed into a plastic bag and incubated at 30 °C. 



Table 21 





Flours 


Innoculation 


Microbial enzymes 


Amylase 
desalting 

Buffer 


Tube 


Control 


ROGA 


AMYI 


Mix 


RXGA 


AMYI-P 




9 


9 


9 


ml 


ml 


ml 


ml 


A 


1.8 






2.1 


0 


0 
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B 


1.8 






1 4 


U.Uob 


U 


4 

1 


O 


1 ft 






2 1 


0 036 


1 


0 


D 


1.8 






2.1 


0 


1 


0.036 


E 


1.6 




0.2 


2.1 


0.036 


0 


1 


F 


0.2 


1.6 




2.1 






1 


G 


0.2 


1.6 




2.1 


0 


1 


0 


H 


0 


1.6 


0.2 


2.1 




0 


1 



The fermentation tubes were weighed at intervals over the 67 h time course. Loss of weight 
corresponds to evolution of C0 2 during fermentation. The ethanol content of the samples was 
determined after 67 h of fermentation by the DCL ethanol assay method. The kit (catalogue # 
229-29) was purchased from Diagnostic Chemicals Limited, Charlottetown, PE, Canada, DIE 
1B0. Samples (10 fxl) were drawn in triplicate from each fermentation tube and diluted into 990 
jal of water. 10 |il of the diluted samples were mixed with 1.25 ml of a 12.5/1 mixture of assay 
buffer / ADH-NAD reagent. Standards (0, 5, 10, 15 & 20% v/v ETOH) were diluted and 
assayed in parallel. Reactions were incubated at 37 °C for 10 min, then A340 read. Standards 
were prepared in duplicate, samples from each fermentation were prepared in triplicate 
(including the initial dilution). The weight of the samples changed with time as detailed in table 
below. The weight loss is expressed as a percentage of the initial sample weight at time 0. 



Table 22 







Time (h) 






0 


18 


24 


42 


48 


67 


Sample 


Flour Composition 


% w 


gt loss 




A 


Control 


0.00 


8.09 


9.38 


12.96 


13.83 


16.85 


B 


Control + RXGA 


0.00 


11.48 


14.20 


21.79 


23.83 


24.63 


C 


Control + RXGA + MAMYI 


0.00 


17.90 


23.27 


36.48 


39.07 


47.59 


D 


Control + MAMYI 


0.00 


13.70 


17.72 


28.27 


30.80 


38.27 


E 


Control +RXGA + AMYI flour 


0.00 


16.85 


21.60 


33.95 


36.98 


45.74 


F 


ROGA flour 


0.00 


9.81 


11.74 


16.96 


18.39 


23.17 


G 


ROGA flour + MAMYI 


0.00 


15.53 


19.69 


29.75 


32.11 


39.94 


H 


ROGA flour + AMYI flour 


0.00 


13.35 


16.27 


23.60 


25.53 


31.68 



These data show that the ROGA enzyme expressed in maize increases fermentation rate as 
compared to the no-enzyme control. It also confirms previous data indicating that the AMYI 
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enzyme expressed in maize kernels is a potent activator of fermentation of the starch in corn. 
The ethanol contents are detailed below. 



Table 23 





Flour 


ETOH 


Standard 


Sample 


Composition 


% v/v 


deviation 


A 


Control 


2.09 


0.08 


B 


Control + RXGA 


7.97 


0.18 


C 


Control + RXGA + MAMYI 


13.47 


0.27 


D 


Control + MAMYI 


11.26 


0.12 


E 


Control +RXGA + AMYI flour 


12.28 


0.08 


F 


ROGA flour 


3.55 


0.05 


G 


ROGA flour + MAMYI 


11.29 


0.18 


H 


ROGA flour + AMYI flour 


8.58 


0.13 



These data also demonstrate that expressing Rhizopus oryzae glucoamylase in maize 
facilitates increased fermentation of the starch in corn. Similarly, expression of the barley 
amylase in maize makes corn starch more fermentable with out adding exogenous enzymes. 



Example 55 

Cellobiohvdrolase I 

The Trichoderma reesei cellobiohydrolase I (CBH I) gene was amplified and cloned by 
RT-PCR based on a published database sequence (accession # E00389). The cDNA sequence 
was analyzed for the presence of a signal sequence using the SignalP program, which predicted a 
1 7 amino acid signal sequence. The DN A sequence encoding the signal sequence was replaced 
with an ATG by PCR, as shown in the sequence (SEQ ID NO: 79). This cDNA sequence was 
used to make subsequent constructs. Additional constructs are made by substituting a maize 
optimised version of the gene (SEQ ED NO: 93). 



Example 56 

Cellobiohvdrolase II 

The Trichoderma reesei cellobiohydrolase II (CBH II) gene was amplified and cloned by 

RT-PCR based on a published database sequence (accession # M55080). The cDNA sequence 

was analyzed for the presence of a signal sequence using the SignalP program, which predicted 
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an 18 amino acid signal sequence. The DNA sequence encoding the signal sequence was 
replaced with an ATG by PCR, as shown in the sequence (SEQ ED NO: 81). This cDNA 
sequence was used to make subsequent constructs. Additional constructs are made by 
substituting a maize optimised version (SEQ ED NO: 94) of the gene. 

Example 57 

Construction of transformation vectors for the Trichoderma reesii cellobiohvdrolase I and 

cellobiohvdrolase II 

Cloning of the Trichoderma reesii cellobiohydrolase I (cWn)cDNA without the native N- 
terminal signal sequence is described in Example 55. Expression cassettes were constructed to 
express the Trichoderma reesii cellobiohydrolase I cDNA in maize endosperm with various 
targeting signals as follows: 

Plasmid 12392 comprises the Trichoderma reesii cbhi cDNA cloned behind the y zein 
promoter for expression specifically in the endosperm for expression in the cytoplasm. 

Plasmid 12391 comprises the maize y-zein N-terminal signal sequence 
(MRVLLVALALLALAASATS)(SEQ ID NO: 17) fused to Trichoderma reesii cbhi cDNA as 
described above in Example 1 for targeting to the endoplasmic reticulum and secretion into the 
apoplast (Torrent et al. 1997). The fusion was cloned behind the y zein promoter for expression 
specifically in the endosperm. 

Plasmidl2392 comprises the y-zein N-terminal signal sequence fused to the Trichoderma 
reesii cbhi cDNA with a C-terminal addition of the sequence KDEL for targeting to and 
retention in the endoplasmic reticulum (ER) (Munro and Pelham, 1987). The fusion was cloned 
behind the maize y zein promoter for expression specifically in the endosperm. 

Plasmid 12656 comprises the waxy amyloplast targeting peptide (Klosgen et al., 1986) 
fused to the Trichoderma reesii cbhi cDNA for targeting to the amyloplast. The fusion was 
cloned behind the maize y zein promoter for expression specifically in the endosperm. 

All expression cassettes were moved into a binary vector (pNOV21 17) for transformation 
into maize via Agrobacterium infection. The binary vector contained the phosphomannose 
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isomerase (PMI) gene which allows for selection of transgenic cells with mannose. Transformed 
maize plants were either self-pollinated or outcrossed and seed was collected for analysis. 

Additional constructs (plasmids 12652,12653,12654 and 12655) were made with the 
targeting signals described above fused to Trichoderma reesii cellobiohydrolasell (cbhii) cDNA 
in precisely the same manner as described for the Trichoderma reesii cbhi cDNA. These ftisions 
were cloned behind the maize Q protein promoter (50Kd y zein) (SEQ ID NO: 98) for expression 
specifically in the endosperm and transformed into maize as described above. Transformed 
maize plants were either self-pollinated or outcrossed and seed was collected for analysis. 

Combinations of the enzymes can be produced either by crossing plants expressing the 
individual enzymes or by cloning several expression cassettes into the same binary vector to 
enable co-transformation. 

Example 58 

Expression of a Cbhi in corn 

Tl seed from self-pollinated maize plants transformed with either plasmid 12390, 12391 
or 1 2392 was obtained. The 12390 construct targets the expression of the Cbhi in the 
endoplasmic reticulum of the endosperm, the 12391 construct targets the expression of the Cbhi 
in the apoplast of the endosperm and the 12392 construct targets the expression of the Cbhi in 
the cytoplasm of the endosperm. 

Extraction and detection of the Cbhi from corn-flour : Polyclonal antibodies to Cbhi and 
Cbhii were produced in goat according to established protocols. Flour from the Cbhi transgenic 
seeds was obtained by grinding them in an Autogizer grinder. Approximately 50 mg of flour 
was resuspended in 0.5ml of 20mM NaP0 4 buffer (pH 7.4),150mM NaCl followed by 
incubation for 15 minutes at RT with continuous shaking. The incubated mixture was then spun 
for lOmin. at 10,000xg. The supernatant was used as enzyme source. 30 \x\ of this extract was 
loaded on a 4-12 % NuPAGE gel (invitrogen) and separated in the NuPAGE MES running 
buffer (invitrogen). Protein was blotted onto nitrocellulose membranes and Western blot 
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analysis was done following established protocols using the specific antibodies described above 
followed by alkaline phosphatase conjugated rabbit antigoat IgG (H+L) . Alkaline phosphatase 
activity was detected by incubation of the membranes with ready to use BCIP/MBT (plus) 

substrate from Moss Inc. 

Western Blot analysis was done of Tl seeds from different events transformed with 
plasmid 12390. Expression of Cbhl protein was compared to the non-transgenic control, and 
was detected in a number of events. 

The Cracked Corn Assay was performed essentially as described in Example 49, using 
transgenic seed expressing Cbhi. Starch recovery from the transgenic seed was measured and 
the results are set forth in Table 24. 

Table 24. 





Line 3-non expressing controi Line 4- 


CBHI expressing 


Conditions 




Starch (mg) 




400ppm S02-No Bromelain 


40.2 




78.1 


400ppmSO2-Plus Bromelain 


48.1 




118.7 


2000ppm S02-No Bromelain 


47.5 




73.1 


2000ppmSO2-Plus Bromelain 


49.2 




109 



Example 59 

Preparation of Endoglucanase I Constructs 

A Trichoderma reesei endoglucanase I (EGLI) gene was amplified and cloned by PCR 
based on a published database sequence (Accession # Ml 5665; Penttila et al., 1986). Because 
only genomic sequences could be obtained, the cDNA was generated from the genomic sequence 
by removing 2 introns using Overlap PCR. The resulting cDNA sequence was analyzed for the 
presence of a signal sequence using the SignalP program, which predicted a 22 amino acid signal 
sequence. The DNA sequence encoding the signal sequence was replaced with an ATG by PCR, 
as shown in the sequence (SEQ ED NO: 83). This cDNA sequence was used to make subsequent 
constructs as set forth below. 
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Overlap PCR 

Overlap PCR is a technique (Ho et al., 1989) used to fuse complementary ends of two or 
more PCR products, and can be used to make base pair (bp) changes, add bp, or delete bp. At 
the site of the intended bp change, forward and reverse mutagenic primers (Mut-F and Mut-R) 
are made that contain the intended change and 15 bp of sequence on either side of the change. 
For example, to remove an intron, the primers would consist of the final 15 bp of exon 1 fused to 
the first 15 bp of exon 2. Primers are also prepared that anneal to the ends of the sequence to be 
amplified, e.g ATG and STOP codon primers. PCR amplification of the products proceeds with 
the ATG/Mut-R primer pair and the Mut-F/STOP primer pair in independent reactions. The 
products are gel purified and fused together in a PCR without added primers. The fusion 
reaction is separated on a gel, and the band of the correct size is gel purified and cloned. 
Multiple changes can be accomplished simultaneously through the addition of additional 
mutagenic primer pairs. 

EGLI Plant Expression Constructs 
Expression cassettes were made to express the Trichoderma reesei EGLI cDNA in maize 
endosperm as follows: 

13025 comprises the T. reesei EGLI gene cloned behind the maize y-zein promoter for 
cytoplasmic localization and expression specifically in the endosperm. 

13026 comprises the maize y-zein N-terminal signal peptide (MRVLLVALALLALAASATS) 
fused to the T. reesei EGLI gene for targeting to the endoplasmic reticulum and secretion into the 
apoplast. The fusion was cloned behind the maize y-zein promoter for expression specifically in 
the endosperm. 

13027 comprises the maize y-zein N-terminal signal peptide fused to the T. reesei EGLI gene 
with a C-terminal addition of the sequence KDEL for targeting to and retention in the 
endoplasmic reticulum. The fusion was cloned behind the maize y-zein promoter for expression 
specifically in the endosperm. 
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13028 comprises the maize Granule Bound Starch Synthase I (GBSSI) N-terminal signal peptide 
(N-terminal 77 amino acids) fused to the T. reesei EGLI gene for targeting to the lumen of the 
amyloplast. The fusion was cloned behind the maize 7-zein promoter for expression specifically 
in the endosperm. 

13029 comprises the maize GBSSI N-terminal signal peptide fused to the T. reesei EGLI gene 
with a C-terminal addition of the starch binding domain (C-terminal 301 amino acids) of the 
maize GBSSI gene for targeting to the starch granule. The fusion was cloned behind the maize 
7-zein promoter for expression specifically in the endosperm. 

Additional Expression cassettes are generated using a maize optimised version of EGLI 

(SEQ ID NO: 95) 

EGLI Enzyme Assays 

EGLI enzyme activity is measured in maize transgenics using the Malt Beta-Glucanase 
Assay Kit (Cat # K-MBGL) (Megazyme International Ireland Ltd.) The enzymatic activity of 
EGL I expressors is tested in the Corn Fiber Hydrolysis Assay as described in Example 53. 
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Example 60 

fl-Glucosidase 2 

A Trichoderma reesei /3-Glucosidase 2 (BGL2) gene was amplified and cloned by RT- 
PCR based on sequence Accession # AB003 1 1 0 (Takashima et al., 1 999). 

BGL2 Plant Expression Constructs 

Expression cassettes were made to express the Trichoderma reesei BGL2 cDNA (SEQ 
ED NO: 89) in maize endosperm as follows: 

13030 comprises the T. reesei BGL2 gene cloned behind the maize 7-zein promoter for 
cytoplasmic localization and expression specifically in the endosperm. 

13031 comprises the maize 7-zein N-terminal signal peptide (MRVLLVALALLALAASATS) 
fused to the T. reesei BGL2 gene for targeting to the endoplasmic reticulum and secretion into 
the apoplast. The fusion was cloned behind the maize 7-zein promoter for expression 
specifically in the endosperm. 

13032 comprises the maize 7-zein N-terminal signal peptide fused to the T. reesei BGL2 gene 
with a C-terminal addition of the sequence KDEL for targeting to and retention in the 
endoplasmic reticulum. The fusion was cloned behind the maize ^zein promoter for expression 
specifically in the endosperm. 

13033 comprises the maize Granule Bound Starch Synthase I (GBSSI) N-terminal signal peptide 
(N-terminal 77 amino acids) fused to the T. reesei BGL2 gene for targeting to the lumen of the 
amyloplast. The fusion was cloned behind the maize 7-zein promoter for expression specifically 
in the endosperm. 

13034 comprises the maize GBSSI N-terminal signal peptide fused to the T. reesei BGL2 gene 

with a C-terminal addition of the starch binding domain (C-terminal 301 amino acids) of the 

159 



WO 2005/096804 



PCT/US2004/007182 



maize GBSSI gene for targeting to the starch granule. The fusion was cloned behind the maize 
7-zein promoter for expression specifically in the endosperm. 

Additional Expression cassettes are generated by substituting a maize optimized version 

of BGL2 (SEQ ID NO: 96). 

All expression cassettes are inserted into the binary vector pNOV21 17 for transformation 
into maize via Agrobacterium infection. The binary vector contained the phosphomannose 
isomerase (PMI) gene which allows for selection of transgenic cells with mannose. Transformed 
maize plants were either self-pollinated or outcrossed and seed was collected for analysis. 

BGL2 Enzyme Assays 

BGL2 enzyme activity is measured in transgenic maize using a protocol modified from 
Bauer and Kelly (Bauer, M.W. and Kelly, R.M. 1998. The family 1 0-glucosidases from 
Pyrococcus furiosus and Agrobacterium faecalis share a common catalytic mechanism. 
Biochemistry 37: 17170-17178). The protocol can be modified to incubate samples at 37°C 
instead of 100°C. The enzymatic activity of BGL2-expressors is tested in the Fiber Hydrolysis 
Assay. 
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Example 61 

fl-Glucosidase D 



The Trichoderma reesei jff-Glucosidase D (CEL3D) gene was amplified and cloned by 
PCR based on a published database sequence (accession # AY281378; Foreman et al, 2003). 
Because only genomic sequences could be obtained, the cDNA was generated from the genomic 
sequence by removing an intron using Overlap PCR, as described in Example 58. The resulting 
cDNA (SEQ ID NO: 91) may be used for subsequent constructs. A maize optimised version 
(SEQ ID NO: 97) of the resulting cDNA may also be used for constructs. 

Plant constructs can be generated and £-glucosidase assays can be performed as described 
for BGL2 in Example 60, replacing BGL2 with CEL3D. 
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Example 62 

Lipases 

cDNAs encoding lipases are generated using sequences from Accession # D85895, 
AF04488, and AF04489 (Tsuchiya et al., 1996; Yu et aL, 2003) and methodology set forth in 
Examples 59-60. 

Lipase enzyme activity can be measured in transgenic maize using the Fluorescent Lipase 
Assay Kit (Cat # M0612)(Marker Gene Technologies, Inc.). Lipase activity can also be 
measured in vivo using the fluorescent substrate l,2-dioleoyl-3-(pyren-l-yl)decanoyl-rac 
glycerol (M0258), also from Marker Gene Technologies, Inc. 

Example 63 

Expression of Phytase in Rice 

Vectors 1 1267 and 1 1268 comprise binary vectors that encode Nov9x phytase. 
Expression of the Nov9x phytase gene in both vectors is under the control of the rice glutelin-1 
promoter (SEQ ID NO:67). Vectors 1 1267 and 1 1268 are derived from pNOV21 17. 

The Nov9x phytase expression cassette in vector 1 1267 comprises the rice glutelin-1 
promoter, the Nov9x phytase gene with apoplast targeting signal, a PEPC intron, and the 35S 
terminator. The product of the Nov9x phytase coding sequence in vector 1 1267 is shown in SEQ 
ID NO: 110. 

The Nov9x phytase expression cassette in vector 1 1268 comprises the rice glutelin-1 
promoter, the Nov9x phytase gene with ER retention (SEQ ID NO:l 1 1), a PEPC intron, and the 
35S terminator. The product of the Nov9x phytase coding sequence in vector 1 1268 is shown in 
SEQ ID NO: 112. 
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11267 Nov9x phytase with apoplast targeting DNA sequence (SEQ ID NO: 109). 
Translation start and stop codons are underlined. The sequence encoding the signal 

sequence of the 27-kD gamma-zein protein is in bold. 

atgagggtgttgctcgttgccctcgctctcctggctctcgctgcgagcgccaccagcgctgcgcagtccgagccggagctgaagctgg 

agtccgtggtgatcgtgtcccgccacggcgtgcgcgccccgaccaaggccacccagctcatgcaggacgtgaccccggacgcctggcc 

gacctggccggtgaagctcggcgagctgaccccgcgcggcggcgagctgatcgcctacctcggccactactggcgccagcgcctcgtg 

gccgacggcctcctcccgaagtgcggctgcccgcagtccggccaggtggccatcatcgccgacgtggacgagcgcacccgcaagacc 

ggcgaggccttcgccgccggcctcgccccggactgcgccatcaccgtgcacacccaggccgacacctcctccccggacccgctcttcaa 

cccgctcaagaccggcgtgtgccagctcgacaacgccaacgtgaccgacgccatcctggagcgcgccggcggctccatcgccgacttc 

accggccactaccagaccgccttccgcgagctggagcgcgtgctcaacttcccgcagtccaacctctgcctcaagcgcgagaagcagga 

cgagtcctgctccctcacccaggccctcccgtccgagctgaaggtgtccgccgactgcgtgtccctcaccggcgccgtgtccctcgcctcc 

atgctcaccgaaatcttcctcctccagcaggcccagggcatgccggagccgggctggggccgcatcaccgactcccaccagtggaacac 

cctcctctccctccacaacgcccagttcgacctcctccagcgcaccccggaggtggcccgctcccgcgccaccccgctcctcgacctcatc 

aagaccgccctcaccccgcacccgccgcagaagcaggcctacggcgtgaccctcccgacctccgtgctcttcatcgccggccacgacac 

caacctcgccaacctcggcggcgccctggagctgaactggaccctcccgggccagccggacaacaccccgccgggcggcgagctggt 

gttcgagcgctggcgccgcctctccgacaactcccagtggattcaggtgtccctcgtgttccagaccctccagcagatgcgcgacaagacc 

ccgctctccctcaacaccccgccgggcgaggtgaagctcaccctcgccggctgcgaggagcgcaacgcccagggcatgtgctccctcg 

ccggcttcacccagatcgtgaacgaggcccgcatcccggcctgctccctctaa 



11267 Nov9x phytase with apoplast targeting gene product (SEQ ID NO:110). The signal 

sequence of the 27-kD gamma-zein protein is in bold. 

mi^Ilvalallalaasatsaaqsepelkleswivsrhgvraptkatqlmqdvtpdawptwpvklgeltprg^ 

dgllpkcgcpqsgqvaiiadvdertrktgeafaaglapdcait^ 

qtafrelervlnfpqsnlclkrekqdescsltqalpselkvsadcvsltgavslasmlteifllqqaqgmpepgw 
aqfdllqrtpevarsratplldliktaltphppqkqaygvtlp^ 

sqwiqvslvfqtlqqmrdktplslntppgevkltlagceernaqgmcslagftqivnearipacsl 

11268 Nov9x phytase with ER retention DNA sequence (SEQ ID NOrlll). The sequence 
encoding the signal sequence of the 27-kD gamma-zein protein is in bold. The sequence 

encoding the SEKDEL hexapeptide ER retention signal is underlined. 

atgagggtgttgctcgttgccctcgctctcctggctctcgctgcgagcgccaccagcgctgcgcagtccgagccggagctgaagctgg 

agtccgtggtgatcgtgtcccgccacggcgtgcgcgccccgaccaaggccacccagctcatgcaggacgtgaccccggacgcctggcc 

gacctggccggtgaagctcggcgagctgaccccgcgcggcggcgagctgatcgcctacctcggccactactggcgccagcgcctcgtg 

gccgacggcctcctcccgaagtgcggctgcccgcagtccggccaggtggccatcatcgccgacgtggacgagcgcacccgcaagacc 

ggcgaggccttcgccgccggcctcgccccggactgcgccatcaccgtgcacacccaggccgacacctcctccccggacccgctcttcaa 

cccgctcaagaccggcgtgtgccagctcgacaacgccaacgtgaccgacgccatcctggagcgcgccggcggctccatcgccgacttc 

accggccactaccagaccgccttccgcgagctggagcgcgtgctcaacttcccgcagtccaacctctgcctcaagcgcgagaagcagga 

cgagtcctgctccctcacccaggccctcccgtccgagctgaaggtgtccgccgactgcgtgtccctcaccggcgccgtgtccctcgcctcc 

atgctcaccgaaatcttcctcctccagcaggcccagggcatgccggagccgggctggggccgcatcaccgactcccaccagtggaacac 

cctcctctccctccacaacgcccagttcgacctcctccagcgcaccccggaggtggcccgctcccgcgccaccccgctcctcgacctcatc 

aagaccgccctcaccccgcacccgccgcagaagcaggcctacggcgtgaccctcccgacctccgtgctcttcatcgccggccacgacac 

caacctcgccaacctcggcggcgccctggagctgaactggaccctcccgggccagccggacaacaccccgccgggcggcgagctggt 

gttcgagcgctggcgccgcctctccgacaactcccagtggattcaggtgtccctcgtgttccagaccctccagcagatgcgcgacaagacc 

163 



WO 2005/096804 



PCT/US2004/007182 



ccgctctccctcaacaccccgccgggcgaggtgaagctcaccctcgccggctgcgaggagcgcaacgcccagggcatgtgctccctcg 
ccggcttcacccagatcgtgaacgaggcccgcatcccggcctgctccctctccgagaaggacgagctgtaa 

11268 Nov9x phytase with ER retention, gene product (SEQ ID NO: 112). The signal 
sequence of the 27-kD gamma-zein protein is in bold. The ER retention signal is 

underlined. 

m™ilvalallalaasatsaaqsepelklesvvivsrhgvraptkatqlm 

dgllpkcgcpqsgqvaiiadvdertrktgeafaaglapdcaitvhtqadtsspdplfhplktgycqldnanvtdaileraggsiadft 
qtafrelervlnfpqsnlclkrekqde 

aqfdllqrtpevarsratplldliktaltphppqkqaygvtlptsvlfiaghdtnlanlggaleln^ 
sqwiqvslvfqtlqqmrdktplslntppgevkltlagceemaqgmc^^ 

Generation of transgenic rice plants 

Rice (Oryza sativa) is used for generating transgenic plants. Various rice cultivars can be 
used (Hiei et al., 1994, Plant Journal 6:271-282; Dong et al., 1996, Molecular Breeding 2:267- 
276; Hiei et al., 1997, Plant Molecular Biology, 35:205-218). Also, the various media 
constituents described below may be either varied in concentration or substituted. Embryogenic 
responses are initiated and/or cultures are established from mature embryos by culturing on MS- 
C1M medium (MS basal salts, 4.3 g/liter; B5 vitamins (200 x), 5 ml/liter; Sucrose, 30 g/liter; 
proline, 500 mg/liter; glutamine, 500 mg/liter; casein hydrolysate, 300 mg/liter; 2,4-D (1 mg/ml), 
2 ml/liter; adjust pH to 5.8 with 1 N KOH; Phytagel, 3 g/liter). Either mature embryos at the 
initial stages of culture response or established culture lines are inoculated and co-cultivated with 
the Agrobacterium strain LBA4404 containing the desired vector construction. Agrobacterium is 
cultured from glycerol stocks on solid YPC medium (100 mg/L spectinomycin and any other 
appropriate antibiotic) for ~2 days at 28 °C. Agrobacterium is re-suspended in liquid MS-CIM 
medium. The Agrobacterium culture is diluted to an OD600 of 0.2-0.3 and acetosyringone is 
added to a final concentration of 200 uM. Agrobacterium is induced with acetosyringone before 
mixing the solution with the rice cultures. For inoculation, the cultures are immersed in the 
bacterial suspension. The liquid bacterial suspension is removed and the inoculated cultures are 
placed on co-cultivation medium and incubated at 22°C for two days. The cultures are then 
transferred to MS-CIM medium with Ticarcillin (400 mg/liter) to inhibit the growth of 
Agrobacterium. For constructs utilizing the PMI selectable marker gene (Reed et al., In Vitro 
Cell. Dev. Biol.-Plant 37:127-132), cultures are transferred to selection medium containing 
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Mannose as a carbohydrate source (MS with 2%Mannose, 300 mg/liter Ticarcillin) after 7 days, 
and cultured for 3-4 weeks in the dark. Resistant colonies are then transferred to regeneration 
induction medium (MS with no 2,4-D, 0.5 mg/liter IAA, 1 mg/liter zeatin, 200 mg/liter 
Ticarcillin 2% Mannose and 3% Sorbitol) and grown in the dark for 14 days. Proliferating 
colonies are then transferred to another round of regeneration induction media and moved to the 
light growth room. Regenerated shoots are transferred to GA7-1 medium (MS with no hormones 
and 2% Sorbitol) for 2 weeks and then moved to the greenhouse when they are large enough and 
have adequate roots. Plants are transplanted to soil in the greenhouse and grown to maturity. 

Example 64 

Analysis of Transgenic Rice Seed Expressing Nov9X Phvtase 

ELISA For The Quantitation Of Nov9X Phytase From Rice Seed 

Quantitation of phytase expressed in transgenic rice seed was assayed by ELISA. One 
(lg) rice seed was ground to flour in a Kleco seed grinder. 50 mg of flour was resuspended in 
the sodium acetate buffer described in example - for Nov9X phytase activity assay and diluted 
as required for the immunoassay. The Nov9X immunoassay is a quantitative sandwich assay for 
the detection of phytase that employs two polyclonal antibodies. The rabbit antibody was 
purified using protein A, and the goat antibody was immunoaffinity purified against recombinant 
phytase (Nov9X) protein produced in Exoli inclusion bodies. Using these highly specific 
antibodies, the assay can measure picogram levels of phytase in transgenic plants. There are three 
basic parts to the assay. The phytase protein in the sample is captured onto the solid phase 
microtiter well using the rabbit antibody. Then a "sandwich" is formed between the solid phase 
antibody, the phytase protein, and the secondary antibody that has been added to the well. After 
a wash step, where unbound secondary antibody has been removed, the bound antibody is 
detected using an alkaline phosphatase-labeled antibody. Substrate for the enzyme is added and 
color development is measured by reading the absorbance of each well. The standard curve uses 
a four-parameter curve fit to plot the concentrations versus the absorbance. 

Phytase activity assay 

165 



WO 2005/096804 



PCT/US2004/007182 



Determination of phytase activity, based upon the estimation of inorganic phosphate 
released on hydrolysis of phytic acid, can be performed at 37°C following the method of 
Engelen, A.J. et al., J. AOAC Inter., 84 , 629 (2001). One unit of enzyme activity is defined as 
the amount of enzyme that liberates 1 jamol of inorganic phosphate per minute under assay 
conditions. For example, phytase activity may be measured by incubating 2.0 ml of the enzyme 
preparation with 4.0 ml of 9.1 mM sodium phytate in 250 mM sodium acetate buffer pH 5.5, 
supplemented with 1 mM CaC12 for 60 minutes at 37°C. After incubation, the reaction is 
stopped by adding 4.0 ml of a color-stop reagent consisting of equal parts of a 10% (w/v) 
ammonium molybdate and a 0.235% (w/v) ammonium vanadate stock solution. Precipitate is 
removed by centrifiigation, and phosphate released is measured against a set of phosphate 
standards spectrophotometrically at 415 nm. Phytase activity is calculated by interpolating the 
A415 nm absorbance values obtained for phytase containing samples using the generated 
phosphate standard curve. 

This procedure may be scaled down to accommodate smaller volumes and adapted to 
preferred containers. Preferred containers include glass test tubes and plastic microplates. 
Partial submersion of the reaction vessel(s) in a water bath is essential to maintain constant 
temperature during the enzyme reaction. 



Table 24 



Trans-genic 
line 


phytase/g 
flour* 


Phytase activity 
units per g flour** 


Endogenous inorganic 
phosphate released by 
cooking of dehusked rice 
seed (nmol/gseed) 


Endogenous inorganic 
phosphate released by cooking 
of dehusked, polished rice 
seed (umol/gseed) 


Wild type 


0 


0 


1.442 


0.469 


1 


510 


916 


1.934 


0.840 


2 


1518 


2800 


2.894 


1.073 



*|ig phytase was assayed by a sandwich ELISA 

**Phytase activity was assayed by Phytase activity assay as described above. 



Assay of Inorganic Phosphate Release During Cooking of Transgenic Rice Expressing Phytase 
Two samples of lg seed from selected rice transgenic lines and a control wildtype line 
was dehusked using a benchtop Kett TR200 automatic rice husker . One sample was then 



166 



WO 2005/096804 



PCT/US2004/007182 



polished for 30 seconds in a Kett Rice polisher. Two volumes of H20 was added to each 
sample and the rice was cooked by immersing the tubes into a beaker of water. The water was 
brought to a boil and held in a full rolling boil for 10 minutes. The "cooked" rice seed was then 
ground to a paste with water bringing the total volume of te slurry to 6 ml. The slurry was 
centrifuged at 15,000xg for 10 minutes and the clear supernatant assayed for released, 
endogenous inorganic phosphate. The assay of released phosphate is based on color formation as 
a result of molybdate and vanadate ions complexing with inorganic phosphate and is measured 
spectrophotometrically at 415nm as described in example - for phytase enzymatic activity. The 

results are in Table 24. 

All publications, patents and patent applications are incorporated herein by 
reference. While in the foregoing specification this invention has been described in relation to 
certain preferred embodiments thereof, and many details have been set forth for purposes of 
illustration, it will be apparent to those skilled in the art that the invention is susceptible to 
additional embodiments and that certain of the details described herein may be varied 
considerably without departing from the basic principles of the invention. 

SEQUENCE LISTING 
<110> Lanahan , Mike 

<120> Self -processing Plants and Plant Parts 

<130> 109846.317 

<140> US 60/315,281 
<141> 2001-08-27 

<160> 60 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 436 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 1 
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Met 


Ala 


Lys 


Tyr 


Leu 


Glu 


Leu 


Glu 


Glu 


Gly 


Gly 


Val 


He 


Met 


Gin 


Ala 


1 




5 










10 










15 




Phe 


Tyr 

mm 


Trp 


Asp 


Val 


Pro 


Ser 


Gly 


Gly 


He 


Trp 


Trp 


Asp 


Thr 


He 


Arg 




20 










25 










30 






Gin 


Lys 

mm 


He 


Pro 


Glu 


Trp 


Tyr 


Asp 


Ala 


Gly 


He 


Ser 


Ala 


He 


Trp 


He 




35 










40 










45 








Pro 


Pro 
50 


Ala 


Ser 


Lys 


Gly 


Met 
55 


Ser 


Gly 


Gly 


Tyr 


Ser 
60 


Met 


Gly 


Tyr 


Asp 


Pro 


Tyr 


Asp 


Tyr 


Phe 


Asp 


Leu 


Gly 


Glu 


Tyr 


Tyr 


Gin 


Lys 


Gly 


Thr 


Val 


65 








70 










75 










80 


Glu 


Thr 


Arg 


Phe 


Gly 
85 


Ser 


Lys 


Gin 


Glu 


Leu 
90 


He 


Asn 


Met 


He 


Asn 
95 


Thr 


Ala 


His 


Ala 


Tyr 

mm 

100 


Gly 


He 


Lys 


Val 


He 
105 


Ala 


Asp 


He 


Val 


He 
110 


Asn 


His 


Arg 


Ala 


Gly 


Gly 


Asp 

mm 


Leu 


Glu 


Trp 


Asn 


Pro 


Phe 


Val 


Gly 


Asp 


Tyr 


Thr 




115 










120 










125 








Trp 


Thr 


Asp 


Phe 


Ser 


Lys 


Val 


Ala 


Ser 


Gly 


Lys 


Tyr 


Thr 


Ala 


Asn 


Tyr 


130 










135 










140 










Leu 


Asp 

m\T 


Phe 


His 


Pro 


Asn 


Glu 


Leu 


His 


Ala 


Gly 


Asp 


Ser 


Gly 


Thr 


Phe 


145 








150 










155 










160 


Gly 


Gly 


Tyr 

m* 


Pro 


Asp 


He 


Cys 


His 


Asp 


Lys 


Ser 


Trp 


Asp 


Gin 


Tyr 


Trp 






165 










170 










175 




Leu 


Trp 


Ala 


Ser 


Gin 


Glu 


Ser 


Tyr 


Ala 


Ala 


Tyr 


Leu 


Arg 


Ser 


He 


Gly 






180 










185 










190 






lie 


Asp 


Ala 


Trp 


Arg 

mm* 


Phe 


Asp 


Tyr 


Val 


Lys 


Gly 


Tyr 


Gly 


Ala 


Trp 


Val 




195 










200 










205 








Val 


Lys 
210 


Asp 


Trp 


Leu 


Asn 


Trp 
215 


Trp 


Gly 


Gly 


Trp 


Ala 
220 


Val 


Gly 


Glu 


Tyr 


Trp 


Asp 


Thr 


Asn 


Val 


Asp 


Ala 


Leu 


Leu 


Asn 


Trp 


Ala 


Tyr 


Ser 


Ser 


Gly 


225 








230 










235 










240 


Ala 


Lys 


Val 


Phe 


Asp 


Phe 


Pro 


Leu 


Tyr 


Tyr 


Lys 


Met 


Asp 


Ala 


Ala 


Phe 








245 










250 










255 




Asp 


Asn 


Lys 


Asn 


He 


Pro 


Ala 


Leu 


Val 


Glu 


Ala 


Leu 


Lys 


Asn 


Gly 


Gly 




260 










265 










270 






Thr 


Val 


Val 
275 


Ser 


Arg 


Asp 


Pro 


Phe 
280 


Lys 


Ala 


Val 


Thr 


Phe 
285 


Val 


Ala 


Asn 


His 


Asp 
290 


Thr 


Asp 


He 


He 


Trp 
295 


Asn 


Lys 


Tyr 


Pro 


Ala 
300 


Tyr 


Ala 


Phe 


He 


Leu 


Thr 


Tyr 


Glu 


Gly 


Gin 


Pro 


Thr 


He 


Phe 


Tyr 


Arg 


Asp 


Tyr 


Glu 


Glu 


305 








310 










315 










320 


Trp 


Leu 


Asn 


Lys 


Asp 


Lys 


Leu 


Lys 


Asn 


Leu 


He 


Trp 


He 


His 


Asp 


Asn 








325 










330 










335 




Leu 


Ala 


Gly 


Gly 


Ser 


Thr 


Ser 


He 


Val 


Tyr 


Tyr 


Asp 


Ser 


Asp 


Glu 


Met 






340 










345 










350 






lie 


Phe 


Val 
355 


Arg 


Asn 


Gly 


Tyr 


Gly 
360 


Ser 


Lys 


Pro 


Gly 


Leu 

365 


He 


Thr 


Tyr 


He 


Asn 
370 


Leu 


Gly 


Ser 


Ser 


Lys 
375 


Val 


Gly 


Arg 


Trp 


Val 
380 


Tyr 


Val 


Pro 


Lys 


Phe 


Ala 


Gly 


Ala 


Cys 


He 


His 


Glu 


Tyr 


Thr 


Gly 


Asn 


Leu 


Gly 


Gly 


Trp 


385 








390 










395 










400 


Val 


Asp 


Lys 


Tyr 


Val 


Tyr 


Ser 


Ser 


Gly 


Trp 


Val 


Tyr 


Leu 


Glu 


Ala 


Pro 








405 










410 










415 




Ala 


Tyr 


Asp 


Pro 


Ala 


Asn 


Gly 


Gin 


Tyr 


Gly 


Tyr 


Ser 


Val 


Trp 


Ser 


Tyr 




420 










425 










430 
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Cys Gly Val Gly 
435 



<210> 2 
<211> 1308 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 



<400> 2 

atggccaagt acctggagct ggaggagggc ggcgtgatca tgcaggcgtt ctactgggac 60 
gtcccgagcg gaggcatctg gtgggacacc atccgccaga agatccccga gtggtacgac 120 
gccggcatct ccgcgatctg gataccgcca gcttccaagg gcatgtccgg gggctactcg 180 
atgggctacg acccgtacga ctacttcgac ctcggcgagt actaccagaa gggcacggtg 240 
gagacgcgct tcgggtccaa gcaggagctc atcaacatga tcaacacggc gcacgcctac 300 
ggcatcaagg tcatcgcgga catcgtgatc aaccacaggg ccggcggcga cctggagtgg 360 
aacccgttcg tcggcgacta cacctggacg gacttctcca aggtcgcctc cggcaagtac 420 
accgccaact acctcgactt ccaccccaac gagctgcacg cgggcgactc cggcacgttc 480 
ggcggctacc cggacatctg ccacgacaag tcctgggacc agtactggct ctgggcctcg 540 
caggagtcct acgcggccta cctgcgctcc atcggcatcg acgcgtggcg cttcgactac 600 
gtcaagggct acggggcctg ggtggtcaag gactggctca actggtgggg cggctgggcg 660 
gtgggcgagt actgggacac caacgtcgac gcgctgctca actgggccta ctcctccggc 720 
gccaaggtgt tcgacttccc cctgtactac aagatggacg cggccttcga caacaagaac 780 
atcccggcgc tcgtcgaggc cc tgaagaac ggcggcacgg tggtctcccg cgacccgttc 84 0 
aaggccgtga ccttcgtcgc caaccacgac acggacatca tctggaacaa gtacccggcg 900 
tacgccttca tcctcaccta cgagggccag cccacgatct tctaccgcga ctacgaggag 960 
tggctgaaca aggacaagct caagaacctg atctggattc acgacaacct cgcgggcggc 1020 
tccactagta tcgtgtacta cgactccgac gagatgatct tcgtccgcaa cggctacggc 1080 
tccaagcccg gcctgatcac gtacatcaac ctgggctcct ccaaggtggg ccgctgggtg 1140 
tacgtcccga agttcgccgg cgcgtgcatc cacgagtaca ccggcaacct cggcggctgg 1200 
gtggacaagt acgtgtactc ctccggctgg gtctacctgg aggccccggc ctacgacccc 1260 
gccaacggcc agtacggcta ctccgtgtgg tcctactgcg gcgtcggc 1308 

<210> 3 
<211> 800 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 3 

Met Gly His Trp Tyr Lys His Gin Arg Ala Tyr Gin Phe Thr Gly Glu 

1 5 10 15 

Asp Asp Phe Gly Lys Val Ala Val Val Lys Leu Pro Met Asp Leu Thr 

20 25 30 

Lys Val Gly He He Val Arg Leu Asn Glu Trp Gin Ala Lys Asp Val 

35 40 45 

Ala Lys Asp Arg Phe He Glu He Lys Asp Gly Lys Ala Glu Val Trp 

50 55 60 

He Leu Gin Gly Val Glu Glu lie Phe Tyr Glu Lys Pro Asp Thr Ser 
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65 

Pro Arg lie Phe 

Phe Leu Thr Asn 

100 

Thr Val Asp Gly 
115 

Pro Thr Asp lie 
130 

Ser Leu Lys Glu 
145 

Gly Tyr Lys Pro 

Tyr Tyr Asp Gly 

180 

Phe Arg Val Trp 
195 

Lys Asn Gly Glu 
210 

Lys Gly Asn Gly 
225 

Val Phe Tyr Leu 

Val Asp Pro Tyr 

260 

Val Val Asn Leu 
275 

Gly Pro Lys lie 
290 

lie Ala Asp lie 
305 

Leu Tyr Leu Gly 

Thr Thr Gly Leu 

340 

lie Leu Pro Phe 
355 

Phe Glu Lys Tyr 
370 

Pro Glu Gly Arg 
385 

Arg Glu Val Lys 

Val lie Met Asp 

420 

Ser Ala Phe Asp 
435 

Thr Gly Ala Tyr 
450 

Glu Arg Pro Met 
465 

Val Lys Glu Tyr 
lie Asp Lys Lys 



70 

Phe Ala Gin Ala 
85 

Pro Val Asp Thr 

Lys Glu lie Pro 

120 

Asp Val Thr Asn 
135 

Glu Asp Leu Arg 
150 

Ala Arg Val lie 
165 

Glu Leu Gly Ala 

Ser Pro Val Ser 

200 

Asp Thr Glu Pro 
215 

Val Trp Glu Ala 
230 

Tyr Gin Leu Glu 
245 

Ser Lys Ala Val 

Ala Arg Thr Asn 

280 

Glu Gly Tyr Glu 
295 

Thr Gly Leu Glu 
310 

Leu Thr Glu Glu 
325 

Ser His Leu Val 

Phe Asp Phe Tyr 

360 

Tyr Asn Trp Gly 
375 

Tyr Ser Thr Asp 
390 

Glu Met Val Lys 
405 

Met Val Phe Pro 

Gin Thr Val Pro 

440 

Leu Asn Glu Ser 
455 

Met Arg Lys Phe 
470 

His He Asp Gly 
485 

Thr Met Leu Glu 



75 

Arg Ser Asn Lys 
90 

Lys Lys Lys Glu 
105 

Val Ser Arg Val 

Tyr Val Arg He 

140 

Lys Asp Val Glu 
155 

Met Met Glu He 
170 

Val Tyr Ser Pro 
185 

Lys Trp Val Lys 

Tyr Gin Val Val 

220 

Val Val Glu Gly 
235 

Asn Tyr Gly Lys 
250 

Tyr Ala Asn Asn 
265 

Pro Glu Gly Trp 

Asp Ala He He 

300 

Asn Ser Gly Val 
315 

Asn Thr Lys Gly 
330 

Glu Leu Gly Val 
345 

Thr Gly Asp Glu 

Tyr Asp Pro Tyr 

380 

Pro Lys Asn Pro 
395 

Ala Leu His Lys 
410 

His Thr Tyr Gly 
425 

Tyr Tyr Phe Tyr 

Gly Cys Gly Asn 

460 

He Val Asp Thr 
475 

Phe Arg Phe Asp 
490 

Val Glu Arg Ala 



80 

Val He Glu Ala 
95 

Leu Phe Lys Val 
110 

Glu Lys Ala Asp 
125 

Val Leu Ser Glu 

Leu He He Glu 

160 

Leu Asp Asp Tyr 
175 

Glu Lys Thr He 
190 

Val Leu Leu Phe 
205 

Asn Met Glu Tyr 

Asp Leu Asp Gly 

240 

He Arg Thr Thr 
255 

Gin Glu Ser Ala 

270 

Glu Asn Asp Arg 
285 

Tyr Glu He His 

Lys Asn Lys Gly 

320 

Pro Gly Gly Val 
335 

Thr His Val His 
350 

Leu Asp Lys Asp 
365 

Leu Phe Met Val 

His Thr Arg He 

400 

His Gly He Gly 
415 

He Gly Glu Leu 
430 

Arg lie Asp Lys 
445 

Val He Ala Ser 

Val Thr Tyr Trp 

480 

Gin Met Gly Leu 
495 

Leu His Lys He 
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500 










505 










510 






Asp 


Pro 


Thr 


He 


He 


Leu 


Tyr 


Gly 


Glu 


Pro 


Trp 


Gly 


Gly 


Trp 


Gly 


Ala 




515 










520 










525 








Pro 


He 


Arg 


Phe 


Gly 


Lys 


Ser 


Asp 


Val 


Ala 


Gly 


Thr 


His 


Val 


Ala 


Ala 




530 










535 










540 










Phe 


Asn 


Asp 


Glu 


Phe 


Arg 


Asp 


Ala 


He 


Arg 


Gly 


Ser 


Val 


Phe 


Asn 


Pro 


545 










550 










555 










560 


Ser 


Val 


Lys 


Gly 


Phe 


Val 


Met 


Gly 


Gly 


Tyr 


Gly 


Lys 


Glu 


Thr 


Lys 


He 










565 










570 










575 




Lys 


Arg 


Gly 


Val 


Val 


Gly 


Ser 


He 


Asn 


Tyr 


Asp 


Gly 


Lys 


Leu 


He 


Lys 








580 










585 










590 






Ser 


Phe 


Ala 


Leu 


Asp 


Pro 


Glu 


Glu 


Thr 


He 


Asn 


Tyr 


Ala 


Ala 


Cys 


His 






595 










600 










605 








Asp 


Asn 


His 


Thr 


Leu 


Trp 


Asp 


Lys 


Asn 


Tyr 


Leu 


Ala 


Ala 


Lys 


Ala 


Asp 




610 










615 










620 










Lys 


Lys 


Lys 


Glu 


Trp 


Thr 


Glu 


Glu 


Glu 


Leu 


Lys 


Asn 


Ala 


Gin, Lys 


Leu 


625 










630 










635 










640 


Ala 


Gly 


Ala 


He 


Leu 


Leu 


Thr 


Ser 


Gin 


Gly 


Val 


Pro 


Phe 


Leu 


His 


Gly 








645 










650 










655 




Gly 


Gin 


Asp 


Phe 


Cys 


Arg 


Thr 


Thr 


Asn 


Phe 


Asn 


Asp 


Asn 


Ser 


Tyr 


Asn 








660 










665 










670 






Ala 


Pro 


He 


Ser 


He 


Asn 


Gly 


Phe 


Asp 


Tyr 


Glu 


Arg 


Lys 


Leu 


Gin 


Phe 






675 










680 










685 








lie 


Asp 


Val 


Phe 


Asn 


Tyr 


His 


Lys 


Gly 


Leu 


He 


Lys 


Leu 


Arg 


Lys 


Glu 




690 










695 










700 










His 


Pro 


Ala 


Phe 


Arg 


Leu 


Lys 


Asn 


Ala 


Glu 


Glu 


He 


Lys 


Lys 


His 


Leu 


705 










710 










715 










720 


Glu 


Phe 


Leu 


Pro 


Gly 


Gly 


Arg 


Arg 


He 


Val 


Ala 


Phe 


Met 


Leu 


Lys 


Asp 










725 










730 










735 




His 


Ala 


Gly 


Gly 


Asp 


Pro 


Trp 


Lys 


Asp 


He 


Val 


Val 


He 


Tyr 


Asn 


Gly 








740 










745 










750 






Asn 


Leu 


Glu 


Lys 


Thr 


Thr 


Tyr 


Lys 


Leu 


Pro 


Glu 


Gly 


Lys 


Trp 


Asn 


Val 






755 










760 










765 








Val 


Val 


Asn 


Ser 


Gin 


Lys 


Ala 


Gly 


Thr 


Glu 


Val 


He 


Glu 


Thr 


Val 


Glu 




770 










775 










780 










Gly 


Thr 


He 


Glu 


Leu 


Asp 


Pro 


Leu 


Ser 


Ala 


Tyr 


Val 


Leu 


Tyr 


Arg 


Glu 


785 










790 










795 










800 



<210> 4 
<211> 2400 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 4 

atgggccact ggtacaagca ccagcgcgcc taccagttca ccggcgagga cgacttcggg 60 
aaggtggccg tggtgaagct cccgatggac ctcaccaagg tgggcatcat cgtgcgcctc 120 
aacgagtggc aggcgaagga cgtggccaag gaccgcttca tcgagatcaa ggacggcaag 18 0 
gccgaggtgt ggatactcca gggcgtggag gagatcttct acgagaagcc ggacacctcc 240 
ccgcgcatct tcttcgccca ggcccgctcc aacaaggtga tcgaggcctt cctcaccaac 300 
ccggtggaca ccaagaagaa ggagctgttc aaggtgaccg tcgacggcaa ggagatcccg 360 
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gtgtcccgcg tggagaaggc cgacccgacc gacatcgacg tgaccaacta cgtgcgcatc 420 
gtgctctccg agtccctcaa ggaggaggac ctccgcaagg acgtggagct gatcatcgag 480 
ggctacaagc cggcccgcgt gatcatgatg gagatcctcg acgactacta ctacgacggc 540 
gagctggggg cggtgtactc cccggagaag accatcttcc gcgtgtggtc cccggtgtcc 600 
aagtgggtga aggtgctcct cttcaagaac ggcgaggaca ccgagccgta ccaggtggtg 660 
aacatggagt acaagggcaa cggcgtgtgg gaggccgtgg tggagggcga cctcgacggc 72 0 
gtgttctacc tctaccagct ggagaactac ggcaagatcc gcaccaccgt ggacccgtac 780 
tccaaggccg tgtacgccaa caaccaggag tctgcagtgg tgaacctcgc ccgcaccaac 84 0 
ccggagggct gggagaacga ccgcggcccg aagatcgagg gctacgagga cgccatcatc 900 
tacgagatcc acatcgccga catcaccggc ctggagaact ccggcgtgaa gaacaagggc 960 
ctctacctcg gcctcaccga ggagaacacc aaggccccgg gcggcgtgac caccggcctc 1020 
tcccacctcg tggagctggg cgtgacccac gtgcacatcc tcccgttctt cgacttctac 1080 
accggcgacg agctggacaa ggacttcgag aagtactaca actggggcta cgacccgtac 1140 
ctcttcatgg tgccggaggg ccgctactcc accgacccga agaacccgca cacccgaatt 1200 
cgcgaggtga aggagatggt gaaggccctc cacaagcacg gcatcggcgt gatcatggac 1260 
atggtgttcc cgcacaccta cggcatcggc gagctgtccg ccttcgacca gaccgtgccg 1320 
tactacttct accgcatcga caagaccggc gcctacctca acgagtccgg ctgcggcaac 1380 
gtgatcgcct ccgagcgccc gatgatgcgc aagttcatcg tggacaccgt gacctactgg 1440 
gtgaaggagt accacatcga cggcttccgc ttcgaccaga tgggcctcat cgacaagaag 1500 
accatgctgg aggtggagcg cgccctccac aagatcgacc cgaccatcat cctctacggc 1560 
gagccgtggg gcggctgggg ggccccgatc cgcttcggca agtccgacgt ggccggcacc 1620 
cacgtggccg ccttcaacga cgagttccgc gacgccatcc gcggctccgt gttcaacccg 1680 
tccgtgaagg gcttcgtgat gggcggctac ggcaaggaga ccaagatcaa gcgcggcgtg 1740 
gtgggctcca tcaactacga cggcaagctc atcaagtcct tcgccctcga cccggaggag 1800 
accatcaact acgccgcctg ccacgacaac cacaccctct gggacaagaa ctacctcgcc 1860 
gccaaggccg acaagaagaa ggagtggacc gaggaggagc tgaagaacgc ccagaagctc 192 0 
gccggcgcca tcctcctcac tagtcagggc gtgccgttcc tccacggcgg ccaggacttc 1980 
tgccgcacca ccaacttcaa cgacaactcc tacaacgccc cgatctccat caacggcttc 2040 
gactacgagc gcaagctcca gttcatcgac gtgttcaact accacaaggg cctcatcaag 2100 
ctccgcaagg agcacccggc cttccgcctc aagaacgccg aggagatcaa gaagcacctg 2160 
gagttcctcc cgggcgggcg ccgcatcgtg gccttcatgc tcaaggacca cgccggcggc 2220 
gacccgtgga aggacatcgt ggtgatctac aacggcaacc tggagaagac cacctacaag 2280 
ctcccggagg gcaagtggaa cgtggtggtg aactcccaga aggccggcac cgaggtgatc 2340 
gagaccgtgg agggcaccat cgagctggac ccgctctccg cctacgtgct ctaccgcgag 2400 



<210> 5 
<211> 693 
<212> PRT 

<213> Sulfolobus solfataricus 



<400> 5 

Met Glu Thr lie 
1 

lie Gly Glu Pro 

20 

Ser Ser Asn Lys 
35 

Gly Asn Lys Val 
50 

lie Gly Leu Gly 
65 

Tyr Val Met Tyr 



Lys lie Tyr Glu 
5 

Phe Pro Pro lie 

Ser Leu Ser Glu 

40 

lie Val Glu Lys 
55 

Glu Lys Ala Phe 
70 

Asn Val Asp Ala 
85 



Asn Lys Gly Val 
10 

Glu Phe Pro Leu 
25 

Leu Gly Leu Thr 

Ser Leu Asp Leu 

60 

Glu Leu Asp Arg 
75 

Gly Ala Tyr Lys 
90 

172 



Tyr Lys Val Val 
15 

Glu Gin Lys lie 
30 

lie Val Gin Gin 
45 

Lys Glu His lie 

Lys Arg Lys Arg 

80 

Lys Tyr Gin Asp 
95 
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Pro Leu Tyr Val 

100 

Ala Thr Gly Tyr 
115 

Gly Leu Glu Glu 
130 

Val Glu Phe Tyr 
145 

Lys Tyr Thr Glu 

Phe Gly Tyr Met 

180 

Val Glu Leu Val 
195 

Val Phe Leu Asp 
210 

His Pro Tyr Arg 
225 

Lys Arg Asn Val 

Asp Gin Asn Tyr 

260 

lie Glu Ser Gly 
275 

Val Tyr Pro Asp 
290 

Leu He Ser Glu 
305 

Met Asn Glu Pro 

Leu Ser Ser Leu 

340 

Phe Pro Asp Asn 
355 

Glu Lys Val Arg 
370 

Lys Gly Phe Arg 
385 

Ala Gly Tyr Ala 

Asn Thr Pro Ser 

420 

Gly Leu Ser He 
435 

Phe Gin Gly Arg 
450 

Val Lys Tyr Tyr 
465 

Lys Ala Thr Asp 

Tyr Lys Glu Lys 

500 

Pro Tyr He Tyr 
515 



Ser He Pro Leu 

Phe Phe Asn Ser 

120 

Tyr Asp Lys Val 
135 

Val He Glu Gly 
150 

Leu Thr Gly Lys 
165 

He Ser Arg Tyr 

Asp He Met Gin 

200 

He His Tyr Met 
215 

Phe Pro Glu Pro 
230 

Lys Leu He Thr 
245 

Ser Pro Phe Leu 

Glu Leu Phe Val 

280 

Phe Phe Arg Glu 
295 

Trp Leu Ser Gin 
310 

Thr Asp Phe Ser 
325 

Pro Val Gin Phe 

Val Val His Tyr 

360 

Asn Ala Tyr Pro 
375 

Thr Ser His Arg 
390 

Gly He Gin Arg 
405 

Trp Asp Asp Leu 

Ser Gly Val Pro 

440 

Asn Phe Ala Glu 
455 

Ala Leu Ala Leu 
470 

Gly He Asp Thr 
485 

Val Lys Glu He 

Ser Leu Ala Leu 

520 



Phe He Ser Val 
105 

Ala Ser Lys Val 

He Val Thr He 

140 

Pro Arg He Glu 
155 

Pro Phe Leu Pro 
170 

Ser Tyr Tyr Pro 
185 

Lys Glu Gly Phe 

Asp Ser Tyr Lys 

220 

Lys Lys Leu He 
235 

He Val Asp His 
250 

Ser Gly Met Gly 
265 

Gly Lys Met Trp 

Asp Thr Arg Glu 

300 

Gly Val Asp Gly 
315 

Arg Ala He Glu 
330 

Arg Asp Asp Arg 
345 

Leu Arg Gly Lys 

Leu Tyr Glu Ala 

380 

Asn Glu He Phe 
395 

Tyr Ala Phe He 
410 

Lys Leu Gin Leu 
425 

Phe Val Gly Cys 

He Asp Asn Ser 

460 

Phe Phe Pro Phe 
475 

Glu Pro Val Phe 
490 

Val Glu Leu Arg 
505 

Glu Ala Ser Glu 
173 



Lys Asp Gly Val 
110 

He Phe Asp Val 
125 

Pro Glu Asp Ser 

Asp Val Leu Glu 

160 

Pro Met Trp Ala 
175 

Gin Asp Lys Val 
190 

Arg Val Ala Gly 
205 

Leu Phe Thr Trp 

Asp Glu Leu His 

240 

Gly He Arg Val 
255 

Lys Phe Cys Glu 
270 

Pro Gly Thr Thr 
285 

Trp Trp Ala Gly 

He Trp Leu Asp 

320 

lie Arg Asp Val 
335 

Leu Val Thr Thr 
350 

Arg Val Lys His 
365 

Met Ala Thr Phe 

He Leu Ser Arg 

400 

Trp Thr Gly Asp 
415 

Gin Leu Val Leu 
430 

Asp He Gly Gly 
445 

Met Asp Leu Leu 

Tyr Arg Ser His 

480 

Leu Pro Asp Tyr 
495 

Tyr Lys Phe Leu 
510 

Lys Gly His Pro 
525 
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Val 


He 


Arg 


Pro 


Leu 


Phe 


Tyr 


Glu 


Phe 


Gin 


Asp 


Asp 


Asp 


Asp 


Met 


Tyr 




530 








535 










540 










Arg 


He 


Glu 


Asp 


Glu 


Tyr 


Met 


Val 


Gly 


Lys 


Tyr 


Leu 


Leu 


Tyr 


Ala 


Pro 


545 










550 










555 










560 


He 


Val 


Ser 


Lys 


Glu 


Glu 


Ser 


Arg 


Leu 


Val 


Thr 


Leu 


Pro 


Arg 


Gly 


Lys 








565 










570 










575 




Trp 


Tyr 


Asn 


Tyr 


Trp 


Asn 


Gly 


Glu 


lie 


He 


Asn 


Gly 


Lys 


Ser 


Val 


Val 






580 










585 










590 






Lys 


Ser 


Thr 


His 


Glu 


Leu 


Pro 


He 


Tyr 


Leu 


Arg 


Glu 


Gly 


Ser 


He 


He 




595 










600 










605 








Pro 


Leu 


Glu 


Gly 


Asp 

Mm 


Glu 


Leu 


He 


Val 


Tyr 


Gly 


Glu 


Thr 


Ser 


Phe 


Lys 




610 










615 










620 










Arg 


Tyr 


Asp 


Asn 


Ala 


Glu 


lie 


Thr 


Ser 


Ser 


Ser 


Asn 


Glu 


He 


Lys 


Phe 


625 






630 










635 










640 


Ser 


Arg 


Glu 


lie 


Tyr 


Val 


Ser 


Lys 


Leu 


Thr 


lie 


Thr 


Ser 


Glu 


Lys 


Pro 










645 










650 










655 




Val 


Ser 


Lys 


lie 


He 


Val 


Asp 


Asp 


Ser 


Lys 


Glu 


lie 


Gin 


Val 


Glu 


Lys 








660 










665 










670 






Thr 


Met 


Gin 


Asn 


Thr 


Tyr 


Val 


Ala 


Lys 


He 


Asn 


Gin 


Lys 


lie 


Arg 


Gly 






675 










680 










685 








Lys 


He 


Asn 


Leu 


Glu 

























690 



<210> 6 
<211> 2082 
<212> DNA 

<213> Sulfolobus solfataricus 
<400> 6 

atggagacca tcaagatcta cgagaacaag ggcgtgtaca aggtggtgat cggcgagccg 60 
ttcccgccga tcgagttccc gctcgagcag aagatctcct ccaacaagtc cctctccgag 120 
ctgggcctca ccatcgtgca gcagggcaac aaggtgatcg tggagaagtc cctcgacctc 180 
aaggagcaca tcatcggcct cggcgagaag gccttcgagc tggaccgcaa gcgcaagcgc 24 0 
tacgtgatgt acaacgtgga cgccggcgcc tacaagaagt accaggaccc gctctacgtg 300 
tccatcccgc tcttcatctc cgtgaaggac ggcgtggcca ccggctactt cttcaactcc 360 
gcctccaagg tgatcttcga cgtgggcctc gaggagtacg acaaggtgat cgtgaccatc 42 0 
ccggaggact ccgtggagtt ctacgtgatc gagggcccgc gcatcgagga cgtgctcgag 4 80 
aagtacaccg agctgaccgg caagccgttc ctcccgccga tgtgggcctt cggctacatg 540 
atctcccgct actcctacta cccgcaggac aaggtggtgg agctggtgga catcatgcag 600 
aaggagggct tccgcgtggc cggcgtgttc ctcgacatcc actacatgga ctcctacaag 660 
ctcttcacct ggcacccgta ccgcttcccg gagccgaaga agctcatcga cgagctgcac 720 
aagcgcaacg tgaagctcat caccatcgtg gaccacggca tccgcgtgga ccagaactac 780 
tccccgttcc tctccggcat gggcaagttc tgcgagatcg agtccggcga gctgttcgtg 840 
ggcaagatgt ggccgggcac caccgtgtac ccggacttct tccgcgagga cacccgcgag 900 
tggtgggccg gcctcatctc cgagtggctc tcccagggcg tggacggcat ctggctcgac 960 
atgaacgagc cgaccgactt ctcccgcgcc atcgagatcc gcgacgtgct ctcctccctc 1020 
ccggtgcagt tccgcgacga ccgcctcgtg accaccttcc cggacaacgt ggtgcactac 1080 
ctccgcggca agcgcgtgaa gcacgagaag gtgcgcaacg cctacccgct ctacgaggcg 1140 
atggccacct tcaagggctt ccgcacctcc caccgcaacg agatcttcat cctctcccgc 1200 
gccggctacg ccggcatcca gcgctacgcc ttcatctgga ccggcgacaa caccccgtcc 1260 
tgggacgacc tcaagctcca gctccagctc gtgctcggcc tctccatctc cggcgtgccg 1320 
ttcgtgggct gcgacatcgg cggcttccag ggccgcaact tcgccgagat cgacaactcg 1380 
atggacctcc tcgtgaagta ctacgccctc gccctcttct tcccgttcta ccgctcccac 1440 
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aaggccaccg acggcatcga caccgagccg gtgttcctcc cggactacta -caaggagaag 1500 
gtgaaggaga tcgtggagct gcgctacaag ttcctcccgt acatctactc cctcgccctc 1560 
gaggcctccg agaagggcca cccggtgatc cgcccgctct tctacgagtt ccaggacgac 1620 
gacgacatgt accgcatcga ggacgagtac atggtgggca agtacctcct ctacgccccg 1680 
atcgtgtcca aggaggagtc ccgcctcgtg accctcccgc gcggcaagtg gtacaactac 1740 
tggaacggcg agatcatcaa cggcaagtcc gtggtgaagt ccacccacga gctgccgatc 1800 
tacctccgcg agggctccat catcccgctc gagggcgacg agctgatcgt gtacggcgag 1860 
acctccttca agcgctacga caacgccgag atcacctcct cctccaacga gatcaagttc 1920 
tcccgcgaga tctacgtgtc caagctcacc atcacctccg agaagccggt gtccaagatc 1980 
atcgtggacg actccaagga gatccaggtg gagaagacca tgcagaacac ctacgtggcc 2040 
aagatcaacc agaagatccg cggcaagatc aacctcgagt ga 2082 

<210> 7 
<211> 1818 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 7 

atggcggctc tggccacgtc gcagctcgtc gcaacgcgcg ccggcctggg cgtcccggac 60 
gcgtccacgt tccgccgcgg cgccgcgcag ggcctgaggg gggcccgggc gtcggcggcg 120 
gcggacacgc tcagcatgcg gaccagcgcg cgcgcggcgc ccaggcacca gcaccagcag 180 
gcgcgccgcg gggccaggtt cccgtcgctc gtcgtgtgcg ccagcgccgg catgaacgtc 240 
gtcttcgtcg gcgccgagat ggcgccgtgg agcaagaccg gaggcctcgg cgacgtcctc 300 
ggcggcctgc cgccggccat ggccgcgaac gggcaccgtg tcatggtcgt ctctccccgc 360 
tacgaccagt acaaggacgc ctgggacacc agcgtcgtgt ccgagatcaa gatgggagac 42 0 
gggtacgaga cggtcaggtt cttccactgc tacaagcgcg gagtggaccg cgtgttcgtt 480 
gaccacccac tgttcctgga gagggtttgg ggaaagaccg aggagaagat ctacgggcct 540 
gtcgctggaa cggactacag ggacaaccag ctgcggttca gcctgctatg ccaggcagca 600 
cttgaagctc caaggatcct gagcctcaac aacaacccat acttctccgg accatacggg 660 
gaggacgtcg tgttcgtctg caacgactgg cacaccggcc ctctctcgtg ctacctcaag 720 
agcaactacc agtcccacgg catctacagg gacgcaaaga ccgctttctg catccacaac 780 
atctcctacc agggccggtt cgccttctcc gactacccgg agctgaacct ccccgagaga 840 
ttcaagtcgt ccttcgattt catcgacggc tacgagaagc ccgtggaagg ccggaagatc 900 
aactggatga aggccgggat cctcgaggcc gacagggtcc tcaccgtcag cccctactac 960 
gccgaggagc tcatctccgg catcgccagg ggctgcgagc tcgacaacat catgcgcctc 1020 
accggcatca ccggcatcgt caacggcatg gacgtcagcg agtgggaccc cagcagggac 1080 
aagtacatcg ccgtgaagta cgacgtgtcg acggccgtgg aggccaaggc gctgaacaag 114 0 
gaggcgctgc aggcggaggt cgggctcccg gtggaccgga acatcccgct ggtggcgttc 12 00 
atcggcaggc tggaagagca gaagggcccc gacgtcatgg cggccgccat cccgcagctc 1260 
atggagatgg tggaggacgt gcagatcgtt ctgctgggca cgggcaagaa gaagttcgag 1320 
cgcatgctca tgagcgccga ggagaagttc ccaggcaagg tgcgcgccgt ggtcaagttc 1380 
aacgcggcgc tggcgcacca catcatggcc ggcgccgacg tgctcgccgt caccagccgc 1440 
ttcgagccct gcggcctcat ccagctgcag gggatgcgat acggaacgcc ctgcgcctgc 1500 
gcgtccaccg gtggactcgt cgacaccatc atcgaaggca agaccgggtt ccacatgggc 1560 
cgcctcagcg tcgactgcaa cgtcgtggag ccggcggacg tcaagaaggt ggccaccacc 1620 
ttgcagcgcg ccatcaaggt ggtcggcacg ccggcgtacg aggagatggt gaggaactgc 1680 
atgatccagg atctctcctg gaagggccct gccaagaact gggagaacgt gctgctcagc 1740 
ctcggggtcg ccggcggcga gccaggggtt gaaggcgagg agatcgcgcc gctcgccaag 1800 
gagaacgtgg ccgcgccc 1818 

<210> 8 
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<211> 606 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 



<400> 8 



Met 


Ala 


Ala 


Leu 


Ala 


Thr 


Ser 


Gin 


Leu 


Val 


Ala 


Thr 


Arg 


Ala 


Gly 


Leu 


1 








5 










10 










15 




Gly 


Val 


Pro 


Asp 


Ala 


Ser 


Thr 


Phe 


Arg 


Arg 


Gly 


Ala 


Ala 


Gin 


Gly 


Leu 






20 










25 










30 






Arg 


Gly 


Ala 

35 


Arg 


Ala 


Ser 


Ala 


Ala 

40 


Ala 


Asp 


Thr 


Leu 


Ser 
45 


Met 


Arg 


Thr 


Ser 


Ala 
50 


Arq 


Ala 


Ala 


Pro 


Arq 
55 


His 


Gin 


His 


Gin 


Gin 
60 


Ala 


Arg 


Arg 


Gly 


Ala 


Arq 


Phe 


Pro 


Ser 


Leu 


Val 


Val 


Cys 


Ala 


Ser 


Ala 


Gly 


Met 


Asn 


Val 


65 








70 










75 










80 


Val 


Phe 


Val 


Gly 


Ala 
85 


Glu 


Met 


Ala 


Pro 


Trp 
90 


Ser 


Lys 


Thr 


Gly 


Gly 
95 


Leu 


Gly 


Asp 


Val 


Leu 
100 


Gly 


Gly 


Leu 


Pro 


Pro 
105 


Ala 


Met 


Ala 


Ala 


Asn 
110 


Gly 


His 


Arq 


Val 


Met 


Val 


Val 


Ser 


Pro 


Arg 


Tyr 


Asp 


Gin 


Tyr 


Lys 


Asp 


Ala 


Trp 




115 










120 










125 








ASD 


Thr 


Ser 


Val 


Val 


Ser 


Glu 


He 


Lys 


Met 


Gly 


Asp 


Gly 


Tyr 


Glu 


Thr 


130 










135 










140 










Val 


Arq 


Phe 


Phe 


His 


Cys 


Tyr 


Lys 


Arg 


Gly 


Val 


Asp 


Arg 


Val 


Phe 


Val 


145 










150 










155 










160 


Asp 


His 


Pro 


Leu 


Phe 


Leu 


Glu 


Arg 


Val 


Trp 


Gly 


Lys 


Thr 


Glu 


Glu 


Lys 








165 










170 










175 




lie 


Tyr 


Gly 


Pro 
180 


Val 


Ala 


Gly 


Thr 


Asp 
185 


Tyr 


Arg 


Asp 


Asn 


Gin 
190 


Leu 


Arg 


Phe 


Ser 


Leu 
195 


Leu 


Cys 


Gin 


Ala 


Ala 
200 


Leu 


Glu 


Ala 


Pro 


Arg 
205 


He 


Leu 


Ser 


Leu 


Asn 
210 


Asn 


Asn 


Pro 


Tyr 


Phe 
215 


Ser 


Gly 


Pro 


Tyr 


Gly 
220 


Glu 


Asp 


Val 


Val 


Phe 


Val 


Cys 


Asn 


Asp 


Trp 


His 


Thr 


Gly 


Pro 


Leu 


Ser 


Cys 


Tyr 


Leu 


Lys 


225 










230 










235 










240 


Ser 


Asn 


Tyr 


Gin 


Ser 
245 


His 


Gly 


He 


Tyr 


Arg 
250 


Asp 


Ala 


Lys 


Thr 


Ala 
255 


Phe 


Cys 


He 


His 


Asn 


He 


Ser 


Tyr 


Gin 


Gly 


Arg 


Phe 


Ala 


Phe 


Ser 


Asp 


Tyr 






260 










265 










270 






Pro 


Glu 


Leu 
275 


Asn 


Leu 


Pro 


Glu 


Arg 
280 


Phe 


Lys 


Ser 


Ser 


Phe 
285 


Asp 


Phe 


He 


Asp 


Gly 
290 


Tyr 


Glu 


Lys 


Pro 


Val 
295 


Glu 


Gly 


Arg 


Lys 


He 
300 


Asn 


Trp 


Met 


Lys 


Ala 


Gly 


He 


Leu 


Glu 


Ala 


Asp 


Arg 


Val 


Leu 


Thr 


Val 


Ser 


Pro 


Tyr 


Tyr 


305 








310 










315 










320 


Ala 


Glu 


Glu 


Leu 


He 
325 


Ser 


Gly 


He 


Ala 


Arg 
330 


Gly 


Cys 


Glu 


Leu 


Asp 
335 


Asn 


lie 


Met 


Arg 


Leu 
340 


Thr 


Gly 


He 


Thr 


Gly 
345 


He 


Val 


Asn 


Gly 


Met 
350 


Asp 


Val 


Ser 


Glu 


Trp 
355 


Asp 


Pro 


Ser 


Arg 


Asp 
360 


Lys 


Tyr 


He 


Ala 


Val 
365 


Lys 


Tyr 


Asp 
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Val 


Ser 
370 


Thr 


Ala 


Val 


Glu 


Ala 
375 


Lys 


Ala 


Leu 


Asn 


Lys 
380 


Glu 


Ala 


Leu 


Gin 


Ala 


Glu 


Val 


Gly 


Leu 


Pro 


Val 


Asp 


Arg 


Asn 


He 


Pro 


Leu 


Val 


Ala 


Phe 


385 










390 










395 










400 


He 


Gly 


Arg 


Leu 


Glu 


Glu 


Gin 


Lys 


Gly 


Pro 


Asp 


Val 


Met 


Ala 


Ala 


Ala 








405 










410 










415 




He 


Pro 


Gin 


Leu 
420 


Met 


Glu 


Met 


Val 


Glu 

425 


Asp 


Val 


Gin 


He 


Val 
430 


Leu 


Leu 


Gly 


Thr 


Gly 
435 


Lys 


Lys 


Lys 


Phe 


Glu 
440 


Arg 


Met 


Leu 


Met 


Ser 
445 


Ala 


Glu 


Glu 


Lys 


Phe 

450 


Pro 


Gly 


Lys 


Val 


Arg 
455 


Ala 


Val 


Val 


Lys 


Phe 
460 


Asn 


Ala 


Ala 


Leu 


Ala 


His 


His 


He 


Met 


Ala 


Gly 


Ala 


Asp 


Val 


Leu 


Ala 


Val 


Thr 


Ser 


Arg 


465 










470 










475 










480 


Phe 


Glu 


Pro 


Cys 


Gly 


Leu 


He 


Gin 


Leu 


Gin 


Gly 


Met 


Arg 


Tyr 


Gly 


Thr 








485 










490 










495 




Pro 


Cys 


Ala 


Cys 
500 


Ala 


Ser 


Thr 


Gly 


Gly 
505 


Leu 


Val 


Asp 


Thr 


lie 
510 


He 


Glu 


Gly 


Lys 


Thr 
515 


Gly 

-* 


Phe 


His 


Met 


Gly 
520 


Arg 


Leu 


Ser 


Val 


Asp 
525 


Cys 


Asn 


Val 


Val 


Glu 
530 


Pro 


Ala 


Asp 


Val 


Lys 
535 


Lys 


Val 


Ala 


Thr 


Thr 
540 


Leu 


Gin 


Arg 


Ala 


He 


Lys 


Val 


Val 


Gly 


Thr 


Pro 


Ala 


Tyr 


Glu 


Glu 


Met 


Val 


Arg 


Asn 


Cys 


545 








550 










555 










560 


Met 


He 


Gin 


Asp 


Leu 
565 


Ser 


Trp 


Lys 


Gly 


Pro 
570 


Ala 


Lys 


Asn 


Trp 


Glu 
575 


Asn 


Val 


Leu 


Leu 


Ser 
580 


Leu 


Gly 


Val 


Ala 


Gly 
585 


Gly 


Glu 


Pro 


Gly 


Val 
590 


Glu 


Gly 


Glu 


Glu 


He 
595 


Ala 


Pro 


Leu 


Ala 


Lys 
600 


Glu 


Asn 


Val 


Ala 


Ala 
605 


Pro 







<210> 9 
<211> 2223 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 9 

atggccaagt acctggagct ggaggagggc ggcgtgatca tgcaggcgtt ctactgggac 60 
gtcccgagcg gaggcatctg gtgggacacc atccgccaga agatccccga gtggtacgac 120 
gccggcatct ccgcgatctg gataccgcca gcttccaagg gcatgtccgg gggctactcg 180 
atgggctacg acccgtacga ctacttcgac ctcggcgagt actaccagaa gggcacggtg 24 0 
gagacgcgct tcgggtccaa gcaggagctc atcaacatga tcaacacggc gcacgcctac 3 00 
ggcatcaagg tcatcgcgga catcgtgatc aaccacaggg ccggcggcga cctggagtgg 360 
aacccgttcg tcggcgacta cacctggacg gacttctcca aggtcgcctc cggcaagtac 420 
accgccaact acctcgactt ccaccccaac gagctgcacg cgggcgactc cggcacgttc 480 
ggcggctacc cggacatctg ccacgacaag tcctgggacc agtactggct ctgggcctcg 540 
caggagtcct acgcggccta cctgcgctcc atcggcatcg acgcgtggcg cttcgactac 600 
gtcaagggct acggggcctg ggtggtcaag gactggctca actggtgggg cggctgggcg 660 
gtgggcgagt actgggacac caacgtcgac gcgctgctca actgggccta ctcctccggc 720 
gccaaggtgt tcgacttccc cctgtactac aagatggacg cggccttcga caacaagaac 780 
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atcccggcgc tcgtcgaggc cctgaagaac ggcggcacgg tggtctcccg cgacccgttc 840 
aaggccgtga ccttcgtcgc caaccacgac acggacatca tctggaacaa gtacccggcg 900 
tacgccttca tcctcaccta cgagggccag cccacgatct tctaccgcga ctacgaggag 960 
tggctgaaca aggacaagct caagaacctg atctggattc acgacaacct cgcgggcggc 102 0 
tccactagta tcgtgtacta cgactccgac gagatgatct tcgtccgcaa cggctacggc 1080 
tccaagcccg gcctgatcac gtacatcaac ctgggctcct ccaaggtggg ccgctgggtg 1140 
tacgtcccga agttcgccgg cgcgtgcatc cacgagtaca ccggcaacct cggcggctgg 1200 
gtggacaagt acgtgtactc ctccggctgg gtctacctgg aggccccggc ctacgacccc 1260 
gccaacggcc agtacggcta ctccgtgtgg tcctactgcg gcgtcggcac atcgattgct 1320 
ggcatcctcg aggccgacag ggtcctcacc gtcagcccct actacgccga ggagctcatc 1380 
tccggcatcg ccaggggctg cgagctcgac aacatcatgc gcctcaccgg catcaccggc 1440 
atcgtcaacg gcatggacgt cagcgagtgg gaccccagca gggacaagta catcgccgtg 1500 
aagtacgacg tgtcgacggc cgtggaggcc aaggcgctga acaaggaggc gctgcaggcg 1560 
gaggtcgggc tcccggtgga ccggaacatc ccgctggtgg cgttcatcgg caggctggaa 1620 
gagcagaagg gccccgacgt catggcggcc gccatcccgc agctcatgga gatggtggag 1680 
gacgtgcaga tcgttctgct gggcacgggc aagaagaagt tcgagcgcat gctcatgagc 1740 
gccgaggaga agttcccagg caaggtgcgc gccgtggtca agttcaacgc ggcgctggcg 1800 
caccacatca tggccggcgc cgacgtgctc gccgtcacca gccgcttcga gccctgcggc 1860 
ctcatccagc tgcaggggat gcgatacgga acgccctgcg cctgcgcgtc caccggtgga 1920 
ctcgtcgaca ccatcatcga aggcaagacc gggttccaca tgggccgcct cagcgtcgac 1980 
tgcaacgtcg tggagccggc ggacgtcaag aaggtggcca ccaccttgca gcgcgccatc 2040 
aaggtggtcg gcacgccggc gtacgaggag atggtgagga actgcatgat ccaggatctc 2100 
tcctggaagg gccctgccaa gaactgggag aacgtgctgc tcagcctcgg ggtcgccggc 2160 
ggcgagccag gggttgaagg cgaggagatc gcgccgctcg ccaaggagaa cgtggccgcg 2220 
ccc ~ " ~ 2223 



<210> 10 
<211> 741 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 



<400> 10 
Met Ala Lys Tyr 
1 

Phe Tyr Trp Asp 

20 

Gin Lys lie Pro 
35 

Pro Pro Ala Ser 
50 

Pro Tyr Asp Tyr 
65 

Glu Thr Arg Phe 

Ala His Ala Tyr 

100 

Arg Ala Gly Gly 
115 

Trp Thr Asp Phe 
130 

Leu Asp Phe His 



Leu Glu Leu Glu 

5 

Val Pro Ser Gly 

Glu Trp Tyr Asp 

40 

Lys Gly Met Ser 
55 

Phe Asp Leu Gly 
70 

Gly Ser Lys Gin 
85 

Gly He Lys Val 

Asp Leu Glu Trp 

120 

Ser Lys Val Ala 
135 

Pro Asn Glu Leu 



Glu Gly Gly Val 
10 

Gly He Trp Trp 
25 

Ala Gly He Ser 

Gly Gly Tyr Ser 

60 

Glu Tyr Tyr Gin 
75 

Glu Leu He Asn 
90 

He Ala Asp He 
105 

Asn Pro Phe Val 

Ser Gly Lys Tyr 

140 

His Ala Gly Asp 
178 



He Met Gin Ala 
15 

Asp Thr He Arg 
30 

Ala He Trp He 
45 

Met Gly Tyr Asp 

Lys Gly Thr Val 

80 

Met He Asn Thr 
95 

Val He Asn His 
110 

Gly Asp Tyr Thr 
125 

Thr Ala Asn Tyr 
Ser Gly Thr Phe 
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145 

Gly Gly Tyr Pro 

Leu Trp Ala Ser 

180 

lie Asp Ala Trp 
195 

Val Lys Asp Trp 
210 

Trp Asp Thr Asn 
225 

Ala Lys Val Phe 

Asp Asn Lys Asn 

260 

Thr Val Val Ser 
275 

His Asp Thr Asp 
290 

Leu Thr Tyr Glu 
305 

Trp Leu Asn Lys 

Leu Ala Gly Gly 

340 

lie Phe Val Arg 
355 

lie Asn Leu Gly 
370 

Phe Ala Gly Ala 
385 

Val Asp Lys Tyr 

Ala Tyr Asp Pro 

420 

Cys Gly Val Gly 
435 

Leu Thr Val Ser 
450 

Arg Gly Cys Glu 
465 

lie Val Asn Gly 

Tyr lie Ala Val 

500 

Leu Asn Lys Glu 
515 

Asn lie Pro Leu 
530 

Pro Asp Val Met 
545 

Asp Val Gin He 
Met Leu Met Ser 



150 

Asp He Cys His 
165 

Gin Glu Ser Tyr 

Arg Phe Asp Tyr 

200 

Leu Asn Trp Trp 
215 

Val Asp Ala Leu 
230 

Asp Phe Pro Leu 
245 

He Pro Ala Leu 

Arg Asp Pro Phe 

280 

He He Trp Asn 
295 

Gly Gin Pro Thr 
310 

Asp Lys Leu Lys 
325 

Ser Thr Ser He 

Asn Gly Tyr Gly 

360 

Ser Ser Lys Val 
375 

Cys He His Glu 
390 

Val Tyr Ser Ser 
405 

Ala Asn Gly Gin 

Thr Ser He Ala 

440 

Pro Tyr Tyr Ala 
455 

Leu Asp Asn He 
470 

Met Asp Val Ser 
485 

Lys Tyr Asp Val 

Ala Leu Gin Ala 

520 

Val Ala Phe He 
535 

Ala Ala Ala He 
550 

Val Leu Leu Gly 
565 

Ala Glu Glu Lys 



155 

Asp Lys Ser Trp 
170 

Ala Ala Tyr Leu 
185 

Val Lys Gly Tyr 

Gly Gly Trp Ala 

220 

Leu Asn Trp Ala 
235 

Tyr Tyr Lys Met 
250 

Val Glu Ala Leu 
265 

Lys Ala Val Thr 

Lys Tyr Pro Ala 

300 

He Phe Tyr Arg 
315 

Asn Leu He Trp 
330 

Val Tyr Tyr Asp 
345 

Ser Lys Pro Gly 

Gly Arg Trp Val 

380 

Tyr Thr Gly Asn 
395 

Gly Trp Val Tyr 
410 

Tyr Gly Tyr Ser 
425 

Gly He Leu Glu 

Glu Glu Leu He 

460 

Met Arg Leu Thr 
475 

Glu Trp Asp Pro 
490 

Ser Thr Ala Val 
505 

Glu Val Gly Leu 

Gly Arg Leu Glu 

540 

Pro Gin Leu Met 
555 

Thr Gly Lys Lys 
570 

Phe Pro Gly Lys 
179 









160 


ASp 


bin 


Tyr 


Trp 






175 




Arg 


Ser 


He 


Gly 




190 






Gly 


Ala 


Trp 


Val 


205 








Val 


Gly 


Glu 


Tyr 


Tyr 


Ser 


Ser 


Gly 








240 


Asp 


Ala 


Ala 


Phe 






255 




Lys 


Asn 


Gly 


Gly 




270 






Phe 


Val 


Ala 


Asn 


285 








Tyr 


Ala 


Phe 


He 


Asp 


Tyr 


Glu 


Glu 








320 


lie 


Hi S 


Asp 


Asn 






335 




Ser 


Asp 


Glu 


Met 




350 






Leu 


He 


Thr 


Tyr 


365 








Tyr 


Val 


Pro 


Lys 


Leu 


Gly 


Gly 


Trp 








400 


Leu 


Glu 


Ala 


Pro 






415 




Val 


Trp 


Ser 


Tyr 




430 






Ala 


Asp 


Arg 


Val 


445 








Ser 


Gly 


He 


Ala 


Gly 


He 


Thr 


Gly 








480 


ser 


Arg 


ASp 


j-iys 






495 




Glu 


Ala 


Lys 


Ala 




510 






Pro 


Val 


Asp 


Arg 


525 








Glu 


Gin 


Lys 


Gly 


Glu 


Met 


Val 


Glu 








560 


Lys 


Phe 


Glu 


Arg 






575 




Val 


Arg 


Ala 


Val 
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580 585 590 

Val Lys Phe Asn Ala Ala Leu Ala His His lie Met Ala Gly Ala Asp 

595 600 605 

Val Leu Ala Val Thr Ser Arg Phe Glu Pro Cys Gly Leu lie Gin Leu 

610 615 620 

Gin Gly Met Arg Tyr Gly Thr Pro Cys Ala Cys Ala Ser Thr Gly Gly 
625 630 635 640 

Leu Val Asp Thr lie lie Glu Gly Lys Thr Gly Phe His Met Gly Arg 

645 650 655 

Leu Ser Val Asp Cys Asn Val Val Glu Pro Ala Asp Val Lys Lys Val 

660 665 670 

Ala Thr Thr Leu Gin Arg Ala lie Lys Val Val Gly Thr Pro Ala Tyr 

675 680 685 

Glu Glu Met Val Arg Asn Cys Met lie Gin Asp Leu Ser Trp Lys Gly 

690 695 700 

Pro Ala Lys Asn Trp Glu Asn Val Leu Leu Ser Leu Gly Val Ala Gly 
705 710 715 720 

Gly Glu Pro Gly Val Glu Gly Glu Glu lie Ala Pro Leu Ala Lys Glu 

725 730 735 

Asn Val Ala Ala Pro 

740 



<210> 11 
<211> 1515 
<212> DNA 
<213> Zea mays 

<400> 11 

ggagagctat gagacgtatg tcctcaaagc cactttgcat tgtgtgaaac caatatcgat 60 
ctttgttact tcatcatgca tgaacatttg tggaaactac tagcttacaa gcattagtga 120 
cagctcagaa aaaagttatc tatgaaaggt ttcatgtgta ccgtgggaaa tgagaaatgt 180 
tgccaactca aacaccttca atatgttgtt tgcaggcaaa ctcttctgga agaaaggtgt 240 
ctaaaactat gaacgggtta cagaaaggta taaaccacgg ctgtgcattt tggaagtatc 300 
atctatagat gtctgttgag gggaaagccg tacgccaacg ttatttactc agaaacagct 360 
tcaacacaca gttgtctgct ttatgatggc atctccaccc aggcacccac catcacctat 420 
ctctcgtgcc tgtttatttt cttgcccttt ctgatcataa aaaaacatta agagtttgca 480 
aacatgcata ggcatatcaa tatgctcatt tattaatttg ctagcagatc atcttcctac 540 
tctttacttt atttattgtt tgaaaaatat gtcctgcacc tagggagctc gtatacagta 600 
ccaatgcatc ttcattaaat gtgaatttca gaaaggaagt aggaacctat gagagtattt 660 
ttcaaaatta attagcggct tctattatgt ttatagcaaa ggccaagggc aaaattggaa 720 
cactaatgat ggttggttgc atgagtctgt cgattacttg caagaaatgt gaacctttgt 780 
ttctgtgcgt gggcataaaa caaacagctt ctagcctctt ttacggtact tgcacttgca 840 
agaaatgtga actccttttc atttctgtat gtggacataa tgccaaagca tccaggcttt 900 
ttcatggttg ttgatgtctt tacacagttc atctccacca gtatgccctc ctcatactct 960 
atataaacac atcaacagca tcgcaattag ccacaagatc acttcgggag gcaagtgcga 1020 
tttcgatctc gcagccacct ttttttgttc tgttgtaagt ataccttccc ttaccatctt 1080 
tatctgttag tttaatttgt aattgggaag tattagtgga aagaggatga gatgctatca 1140 
tctatgtact ctgcaaatgc atctgacgtt atatgggctg cttcatataa tttgaattgc 1200 
tccattcttg ccgacaatat attgcaaggt atatgcctag ttccatcaaa agttctgttt 1260 
tttcattcta aaagcatttt agtggcacac aatttttgtc catgagggaa aggaaatctg 1320 
ttttggttac tttgcttgag gtgcattctt catatgtcca gttttatgga agtaataaac 1380 
ttcagtttgg tcataagatg tcatattaaa gggcaaacat atattcaatg ttcaattcat 1440 
cgtaaatgtt ccctttttgt aaaagattgc atactcattt atttgagttg caggtgtatc 1500 
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tagtagttgg aggag 1515 

<210> 12 
<211> 673 
<212> DNA 
<213> Zea mays 

<400> 12 

gatcatccag gtgcaaccgt ataagtccta aagtggtgag gaacacgaaa caaccatgca 60 
ttggcatgta aagctccaag aatttgttgt atccttaaca actcacagaa catcaaccaa 120 
aattgcacgt caagggtatt gggtaagaaa caatcaaaca aatcctctct gtgtgcaaag 180 
aaacacggtg agtcatgccg agatcatact catctgatat acatgcttac agctcacaag 240 
acattacaaa caactcatat tgcattacaa agatcgtttc atgaaaaata aaataggccg 300 
gacaggacaa aaatccttga cgtgtaaagt aaatttacaa caaaaaaaaa gccac atgtc 360 
aagctaaatc taattcgttt tacgtagatc aacaacctgt agaaggcaac aaaactgagc 420 
cacgcagaag tacagaatga ttccagatga accatcgacg tgctacgtaa agagagtgac 480 
gagtcatata catttggcaa gaaaccatga agctgcctac agccgtctcg gtggcataag 540 
aacacaagaa attgtgttaa ttaatcaaag ctataaataa cgctcgcatg cctgtgcact 600 
tctccatcac caccactggg tcttcagacc attagcttta tctactccag agcgcagaag 660 
aacccgatcg aca 673 

<210> 13 
<211> 454 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 13 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 

15 10 15 

Ala Thr Ser Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Val lie Met 

20 25 30 

Gin Ala Phe Tyr Trp Asp Val Pro Ser Gly Gly He Trp Trp Asp Thr 

35 40 45 

He Arg Gin Lys He Pro Glu Trp Tyr Asp Ala Gly He Ser Ala He 

50 55 60 

Trp He Pro Pro Ala Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly 
65 70 75 80 

Tyr Asp Pro Tyr Asp Tyr Phe Asp Leu Gly Glu Tyr Tyr Gin Lys Gly 

85 90 95 

Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu He Asn Met He 

100 105 110 

Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp He Val He 

115 120 125 

Asn His Arg Ala Gly Gly Asp Leu Glu Trp Asn Pro Phe Val Gly Asp 

130 135 140 

Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr Thr Ala 
145 150 155 160 

Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly 

165 170 175 

Thr Phe Gly Gly Tyr Pro Asp He Cys His Asp Lys Ser Trp Asp Gin 

180 185 190 
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TVr 

X Jr Mm 


Trn 


Leu 

XJ w V_«4 


Trn 


Ala 


Ser 


Gin 


Glu 






195 










200 


lie 

■X> J- 


Glv 


lie 


Asp 


Ala 


Trp 


Arg 


Phe 




210 










215 






Val 


Val 


Lvs 


Asp 


Trp 


Leu 


Asn 


225 










230 






Glu 


Tvr 


Tro 


Asp 


Thr 


Asn 


Val 


Asp 










245 








Ser 


Glv 


Ala 


Lvs 


Val 


Phe 


Asp 


Phe 








260 










Ala 


Phe 


ASD 


Asn 


Lvs 


Asn 


lie 


Pro 






275 










280 


Glv 


Glv 


Thr 


Val 


Val 


Ser 


Arg 


Asp 




290 










295 




Ala 


Asn 


His 


ASD 


Thr 


Asp 


He 


He 


3 0 5 










310 






rue 


lie 


Leu 

X_l V_#< \#A 


Thr 

miW X X X» 


Tvr 


Glu 


Gly 


Gin 










325 








Glu 


Glu 




Leu 


Asn 


Lvs 


Asp 


Lys 

ml 








340 














Leu 


Ala 


Glv 


Glv 


Ser 


Thr 






355 










360 


Glu 

\JJ X 1-1 


Met 


lie 


Phe 


Val 


Arq 


Asn 


Gly 




37 0 










375 




Th t" 


Tvr" 

x y j« 


lie 

jj ■ 


Asn 


Leu 


Gly 


Ser 


Ser 


385 










390 






Pro 


Lys 


Phe 


Ala 


Gly 


Ala 


Cys 


He 










405 








Gly 


Trp 


Val 


Asp 


Lys 


Tyr 


Val 


Tyr 








420 










Ala 


Pro 


Ala 


Tyr 


Asp 


Pro 


Ala 


Asn 






435 










440 


Ser 


Tyr 


Cys 


Gly 


val 


Gly 







450 



Ser 


Tyr 


Ala 


Ala 


Tyr 


Leu 


Arg 


Ser 










205 








Asp 

tr 


Tyr 

ml 


Val 


Lys 


Gly 


Tyr 


Gly 


Ala 








220 










Trp 

IT 


Trp 


Gly 


Gly 


Trp 

x> 


Ala 


Val 


Gly 






235 










240 


Ala 


Leu 


Leu 


Asn 


Trp 

*• 


Ala 


Tyr 


Ser 




250 










255 




Pro 


Leu 


Tyr 


Tyr 


Lys 


Met 


Asp 

x> 


Ala 


265 










270 






Ala 


Leu 


Val 


Glu 


Ala 


Leu 


Lys 


Asn 










285 








Pro 


Phe 


Lys 


Ala 


Val 


Thr 


Phe 


Val 








300 










Trp 


Asn 


Lys 

ml 


Tyr 


Pro 


Ala 


Tyr 


Ala 






315 










320 


Pro 


Thr 


He 


Phe 


Tyr 


Arg 


Asp 


Tyr 




330 










335 




Leu 


Lvs 


Asn 


Leu 


lie 


Trp 


He 


His 


345 










350 






Ser 


He 


Val 


Tyr 


Tyr 


Asp 

XT 


Ser 


Asp 

x> 










365 








TVr 

ml 


Gly 


Ser 


Lys 


Pro 


Gly 


Leu 


He 








380 










Lys 


Val 


Gly 


Arg 


Trp 


val 


Tyr 


Val 






395 










400 


His 


Glu 


Tyr 


Thr 


Gly 


Asn 


Leu 


Gly 




410 










415 




Ser 


Ser 


Gly 


Trp 


Val 


Tyr 


Leu 


Glu 


425 










430 






Gly 


Gin 


Tyr 


Gly 


Tyr 


Ser 


Val 


Trp 



445 



<210> 14 

<211> 460 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 

<400> 14 



Met 


Arg 


Val 


Leu 


Leu 


Val 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 


1 






5 










10 










15 




Ala 


Thr 


Ser 


Ala 


Lys 


Tyr 


Leu 


Glu 


Leu 


Glu 


Glu 


Gly 


Gly 


Val 


He 


Met 








20 










25 










30 






Gin 


Ala 


Phe 


Tyr 


Trp 


Asp 


Val 


Pro 


Ser 


Gly 


Gly 


He 


Trp 


Trp 


Asp 


Thr 






35 










40 










45 








He 


Arg 


Gin 


Lys 


He 


Pro 


Glu 


Trp 


Tyr 


Asp 


Ala 


Gly 


He 


Ser 


Ala 


He 




50 










55 










60 










Trp 


He 


Pro 


Pro 


Ala 


Ser 


Lys 


Gly 


Met 


Ser 


Gly 


Gly 


Tyr 


Ser 


Met 


Gly 



182 



WO 2005/096804 



PCT/US2004/007182 



TVr 
x y l 




XT J. \J 


Tvr 

x y x 


Thr 


Val 


Glu 


Thr 








100 


noil 


ins. 


A 1 -a 


n x o 






1 1 C 
113 




R on 


Hi <=• 


A TCI 
nx y 


Ala 




ljU 






T\ /■ y- 
lyr 


Th r 




Th t* 


14!) 








Hen 


iyr 




flcn 

O y-J 


Thr 


Phe 


Gly 


Gly 








180 


lyr 


± x p 


Lie Li 


T r~TO 






i y b 




lie 


<jiy 


Tip 

lie 


A o to 
M.o£J 




OTA 






i rp 


V d _L 


VAX 


T .\/ o 
XJ y O 


IOC 








pi n 

IjlU 


iyr 


T rr\ 
lip 


A e io 
ASp 


Ser 


Gly 


Ala 


Lys 








260 


Ala 


rile 


A C? tO 

Hbp 


nbll 






Z / D 






ui y 


Th r 


V C* X 




O Q /O 

z y u 






Aid 


7V c - ^ t"-\ 


nl o 


A a to 
ASp 


^ A C 








Ph a 

trie 


Tin 
lie 


T ,pi l 
JJC LI 


X 11X 


Glu 


Glu 


Trp 


Leu 








340 


nop 


A c; n 


T iPi i 

J_J 53 l_* 


Ala 






Jj D D 




cj 1 it 




X J. C 


Phe 










Th y 


Tvr 

i y x 


lie 


Asn 


^ 

385 








Pro 


Lys 


Phe 


Ala 


Gly 


Trp 


Val 


Asp 








420 


Ala 


Pro 


Ala 


Tyr 






435 




Ser 


Tyr 


Cys 


Gly 




450 









70 






A c 'O 


Tvr 
i y x 


Phe 




85 








A TCI 


Phe 


Glv 


Ser 


Ala 


Tvr 


Glv 


lie 








120 


Gly 


Gly 


Asp 


Leu 






13 5 

X —J — ' 




A a to 
nop 


php 


Cp-r 
OCX 


xi j o 




150 






Ph <=» 

tr 11G 


Hi Q 
n x o 


Prn 


f^O 11 


165 

-i_ W w> 








iyr 


lr 1 U 


A en 
oisp 


Tie 

X X c 


A 1 a 


O C X 


m n 

\J X 11 


\J X u 








200 


Ala 


Trp 


Arg 


Phe 






215 




A c* to 


i rp 


T .pi i 

XJ~ \X 


A q n 

/OO 11 




2 3 0 
-j <j 






1 11 X 


A c n 


\J -3. 1 

vai 


A c? to 


24 5 








Val 


ph*a 
xr 11 1: 


A c to 


Phe 

ir lie 


by o 


A c t-i 

/AO 1 1 


lie 

X X ^ 


Pro 








280 


Val 


Ser 


Arg 


Asp 






295 




Th f~ 
1 111 


A a r\ 
nop 


Tie 

X X c 


Tie 




310 

*j JL v 






T\/ V 

iyr 


ul U 


m v 

VJ X y 


Gl n 

VJ X 1 1 


32 5 








Aqn 


T i V Q 
li y o 


A c; lO 


TiVS 

xj y o 


Glv 

w X y 


Gl v 
vjx y 


Ser 


Thr 








360 


Val 


Arg 


Asn 


Gly 






375 




T.ai t 
XI C 


m v 

o x y 


Q©r 


Ser 




3 90 






Gly 


Aia 


cys 


Tic 

lie 


405 








Lys 


Tyr 


Val 


Tyr 


Asp 


Pro 


Ala 


Asn 








440 


Val 


Gly 


Ser 


Glu 






455 









75 




Tien 


Glv 

\jx y 


Glu 


Tvr 




90 






Lvs 


Gin 


Glu 


Leu 


105 








Lvs 


Val 


lie 


Ala 


Glu 


Trr> 


Asn 


Pro 








140 


Val 


Ala 


Ser 


Gly 






155 




Glu 


i—i v — ■ d. 


His 

X 1 X ' 


Ala 




170 








His 


Asd 


Lvs 


18 5 

_1_ \J *-S 








Ser 

* — ' V- 1- 


Tvr 

x y x 


Ala 

A * -X. Vf-*. 


Ala 


ZOO£rf 


Tvr 

x y x 


Val 

V i_^X ^L. 


Lvs 








220 


Trp 


Trp 


Gly 


Gly 






23 5 

mJ 




Ala 

Al C* 


Leu 

XJ w CA 


Leu 

±J X^* 


Asn 




250 






Pro 

Jr x w 


T.pn 

JLJ w LJjI 


Tvr 


Tvr 

x y x 


2 65 

^ V-/ w' 








Ala 


Leu 


Val 


Glu 

Bpk >p*1* 


Pro 


Phe 


Lys 


Ala 








300 


Trp 


Asn 


Lys 


Tyr 






315 




Pro 


Thr 

X 1 1 -x- 


lie 


Phe 




330 






Leu 

U V— 


Lvs 

xjy o 


Asn 


Leu 


34 5 








Ser 


lie 


Val 


Tvr 


Tvr 


Gly 


Ser 


Lvs 








380 


Lys 


Val 


Gly 


Arg 






395 




His 


Glu 


Tyr 


Thr 




410 






Ser 


Ser 


Gly 


Trp 


425 








Gly 


Gin 


Tyr 


Gly 


Lys 


ASp 


Glu 


Leu 



460 



Tyr 


Gin 


Lys 


Gly 






95 




lie 


Asn 


Met 


He 




110 






Asp 


He 


Val 


He 


125 








Phe 


Val 


Gly 


Asp 


Lys 


Tyr 


Thr 


Ala 








160 


Gly 


Asp 


Ser 


Gly 






175 




Ser 


Trp 


Asp 


Gin 




190 






Tyr 


Leu 


Arg 


Ser 


205 








Gly 


Tyr 


Gly 


Ala 


Trp 


Ala 


Val 


Gly 








240 


Trp 


Ala 


Tyr 


Ser 






255 




Lys 


Met 


Asp 

•Mm 


Ala 




270 






Ala 


Leu 


Lys 


Asn 


285 








Val 


Thr 


Phe 


Val 


Pro 


Ala 


Tyr 


Ala 








320 


Tyr 


Arg 


Asp 


Tyr 






335 




He 


Trp 


He 


His 




350 






Tyr 


Asp 

X 


Ser 


Asp 

mm 


365 








Pro 


Gly 


Leu 


He 


Trp 


Val 


Tyr 


Val 








400 


Gly 


Asn 


Leu 


Gly 






415 




Val 


Tyr 


Leu 


Glu 




430 






Tyr 


Ser 


Val 


Trp 



<210> 15 
<211> 518 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> synthetic 



<400> 15 



Met 


Leu 


Ala 


Ala 


Leu 


Ala 


Thr 


Ser 


Gin 


Leu 


Val 


Ala 


Thr 


Arg 


Ala 


Gly 


1 








5 










10 










15 




Leu 


Glv 


Val 


Pro 


Asp 


Ala 


Ser 


Thr 


Phe 


Arg 


Arg 


Gly 


Ala 


Ala 


Gin 


Gly 






20 










25 










30 








Ara 


Glv 
35 


Ala 


Arg 


Ala 


Ser 


Ala 
40 


Ala 


Ala 


Asp 


Thr 


Leu 
45 


Ser 


Met 


Arg 


Thr 


Ser 
50 


Ala 


Ara 


Ala 


Ala 


Pro 
55 


Arg 


His 


Gin 


His 


Gin 
60 


Gin 


Ala 


Arg 


Arg 




Ala 


Ara 


Phe 


Pro 


Ser 


Leu 


Val 


Val 


Cys 


Ala 


Ser 


Ala 


Gly 


Ala 


Met 


65 








70 










75 










80 


Ala 


TiV<3 


Tvr 


Leu 


Glu 


Leu 


Glu 


Glu 


Gly 


Gly 


Val 


He 


Met 


Gin 


Ala 


Phe 






85 










90 










95 




Tvr 

i. y J. 


Trt> 


Asd 


Val 


Pro 


Ser 


Gly 


Gly 


He 


Trp 


Trp 

JU 


Asp 


Thr 


He 


Arg 


Gin 






100 










105 










110 








lie 

J- -L. Nw 


Pro 


Glu 


TrD 


Tvr 


Asp 


Ala 


Gly 


He 


Ser 


Ala 


He 


Trp 


He 


Pro 




115 

-1- -A- 










120 










125 








Pro 


Ala 
130 


Ser 


LVS 


Gly 


Met 


Ser 
135 


Gly 


Gly 


Tyr 


Ser 


Met 
140 


Gly 


Tyr 


Asp 


Pro 


Tvt 
x y .l 


Asd 


Tvr 


Phe 


Asp 


Leu 


Gly 


Glu 


Tyr 


Tyr 


Gin 


Lys 


Gly 


Thr 


Val 


Glu 


145 










150 










155 










160 


X 1 1 JL 


Ara 


Phe 


Glv 


Ser 


Lys 


Gin 


Glu 


Leu 


He 


Asn 


Met 


He 


Asn 


Thr 


Ala 








165 










170 










175 




His 


Ala 


Tvr 


Gly 

180 


He 


Lys 


Val 


He 


Ala 
185 


Asp 


He 


Val 


He 


Asn 
190 


His 


Arg 


Ala 


Gly 


Gly 

195 


Asp 


Leu 


Glu 


Trp 


Asn 
200 


Pro 


Phe 


Val 


Gly 


Asp 
205 


Tyr 


Thr 


Trp 


Thr 


Asp 
210 


Phe 


Ser 


Lys 


Val 


Ala 
215 


Ser 


Gly 


Lys 


Tyr 


Thr 
220 


Ala 


Asn 


Tyr 


Leu 


Asd 


Phe 


His 


Pro 


Asn 


Glu 


Leu 


His 


Ala 


Gly 


Asp 


Ser 


Gly 


Thr 


Phe 


Gly 


225 










230 










235 










240 


Gly 


Tyr 


Pro 


Asp 


He 


Cys 


His 


Asp 


Lys 


Ser 


Trp 


Asp 


Gin 


Tyr 


Trp 


Leu 






245 










250 










255 




Trp 


Ala 


Ser 


Gin 


Glu 


Ser 


Tyr 


Ala 


Ala 


Tyr 


Leu 


Arg 


Ser 


He 


Gly 


He 






260 










265 










270 






Asp 


Ala 


Trp 
275 


Arg 


Phe 


Asp 


Tyr 


Val 
280 


Lys 


Gly 


Tyr 


Gly 


Ala 
285 


Trp 


Val 


Val 


Lys 


Asp 
290 


Trp 


Leu 


Asn 


Trp 


Trp 
295 


Gly 


Gly 


Trp 


Ala 


Val 
300 


Gly 


Glu 


Tyr 


Trp 


Asp 


Thr 


Asn 


Val 


Asp 


Ala 


Leu 


Leu 


Asn 


Trp 


Ala 


Tyr 


Ser 


Ser 


Gly 


Ala 


305 










310 










315 










320 


Lys 


Val 


Phe 


Asp 


Phe 


Pro 


Leu 


Tyr 


Tyr 


Lys 


Met 


Asp 


Ala 


Ala 


Phe 


Asp 








325 










330 










335 




Asn 


Lys 


Asn 


He 


Pro 


Ala 


Leu 


Val 


Glu 


Ala 


Leu 


Lys 


Asn 


Gly 


Gly 


Thr 






340 










345 










350 






Val 


Val 


Ser 
355 


Arg 


Asp 


Pro 


Phe 


Lys 
360 


Ala 


Val 


Thr 


Phe 


Val 
365 


Ala 


Asn 


His 


Asp 


Thr 

370 


Asp 


He 


He 


Trp 


Asn 
375 


Lys 


Tyr 


Pro 


Ala 


Tyr 
380 


Ala 


Phe 


He 


Leu 
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Thr 


Tyr 


Glu 


Gly 


Gin 


Pro 


Thr 


He 


Phe 


Tyr 


Arg 


Asp 


Tyr 


Glu 


Glu 


Trp 


385 








390 










395 










400 


Leu 


Asn 


Lys 


Asp 


Lys 


Leu 


Lys 


Asn 


Leu 


He 


Trp 


He 


His 


Asp 


Asn 


Leu 








405 










410 










415 




Ala 


Gly 


Gly 


Ser 


Thr 


Ser 


He 


Val 


Tyr 


Tyr 


Asp 


Ser 


Asp 


Glu 


Met 


He 






420 










425 










430 






Phe 


Val 


Arg 


Asn 


Gly 


Tyr 


Gly 


Ser 


Lys 


Pro 


Gly 


Leu 


He 


Thr 


Tyr 


He 






435 










440 










445 








Asn 


Leu 


Gly 


Ser 


Ser 


Lys 


Val 


Gly 


Arg 


Trp 


Val 


Tyr 


Val 


Pro 


Lys 


Phe 




450 










455 










460 










Ala 


Gly 


Ala 


Cys 


lie 


His 


Glu 


Tyr 


Thr 


Gly 


Asn 


Leu 


Gly 


Gly 


Trp 


Val 


465 








470 










475 










480 


Asp 


Lys 


Tyr 


Val 


Tyr 


Ser 


Ser 


Gly 


Trp 


Val 


Tyr 


Leu 


Glu 


Ala 


Pro 


Ala 






485 










490 










495 




Tyr 


Asp 


Pro 


Ala 


Asn 


Gly 


Gin 


Tyr 


Gly 


Tyr 


Ser 


Val 


Trp 


Ser 


Tyr 


Cys 




500 










505 










510 






Gly 


Val 


Gly 


Thr 


Ser 


He 























515 



<210> 16 

<211> 820 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 

<400> 16 



Met 


Leu 


Ala 


Ala 


Leu 


Ala 


Thr 


Ser 


1 








5 








Leu 


Gly 


Val 


Pro 


Asp 


Ala 


Ser 


Thr 








20 










Leu 


Arg 


Gly 


Ala 


Arg 


Ala 


Ser 


Ala 






35 










40 


Thr 


Ser 


Ala 


Arg 


Ala 


Ala 


Pro 


Arg 




50 










55 




Gly Ala 


Arg 


Phe 


Pro 


Ser 


Leu 


Val 


65 










70 






Ala 


Lys 


Tyr 


Leu 


Glu 


Leu 


Glu 


Glu 










85 








Tyr 


Trp 


Asp 


Val 


Pro 


Ser 


Gly 


Gly 








100 










Lys 


He 


Pro 


Glu 


Trp 


Tyr 


Asp 


Ala 






115 










120 


Pro 


Ala 


Ser 


Lys 


Gly 


Met 


Ser 


Gly 




130 










135 




Tyr 


Asp 


Tyr 


Phe 


Asp 


Leu 


Gly 


Glu 


145 










150 






Thr 


Arg 


Phe 


Gly 


Ser 


Lys 


Gin 


Glu 










165 








His 


Ala 


Tyr 


Gly 


He 


Lys 


Val 


He 








180 










Ala 


Gly 


Gly 


Asp 


Leu 


Glu 


Trp 


Asn 



Gin 


Leu 


Val 


Ala 


Thr 


Arg 


Ala 


Gly 




10 










15 




Phe 


Arg 


Arg 


Gly 


Ala 


Ala 


Gin 


Gly 


25 










30 






Ala 


Ala 


Asp 


Thr 


Leu 


Ser 


Met 


Arg 










45 








His 


Gin 


His 


Gin 


Gin 


Ala 


Arg 


Arg 








60 










Val 


Cys 


Ala 


Ser 


Ala 


Gly 


Ala 


Met 






75 










80 


Gly 


Gly 


Val 


He 


Met 


Gin 


Ala 


Phe 




90 










95 




He 


Trp 


Trp 


Asp 


Thr 


He 


Arg 


Gin 


105 










110 






Gly 


He 


Ser 


Ala 


He 


Trp 


He 


Pro 










125 








Gly 


Tyr 


Ser 


Met 


Gly 


Tyr 


Asp 


Pro 








140 










Tyr 


Tyr 


Gin 


Lys 


Gly 


Thr 


Val 


Glu 






155 










160 


Leu 


He 


Asn 


Met 


He 


Asn 


Thr 


Ala 




170 










175 




Ala 


Asp 


He 


Val 


He 


Asn 


His 


Arg 


185 










190 






Pro 


Phe 


val 


Gly 


Asp 


Tyr 


Thr 


Trp 



185 
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195 










200 










205 








Thr 


Asp 
210 


Phe 


Ser 


Lys 


Val 


Ala 
215 


Ser 


Gly 


Lys 


Tyr 


Thr 
220 


Ala 


Asn 


Tyr 


Leu 


Asp 


Phe 


His 


Pro 


Asn 


Glu 


Leu 


His 


Ala 


Gly 


Asp 


Ser 


Gly 


Thr 


Phe 


Gly 


225 










230 










235 










240 


Gly 


Tyr 


Pro 


Asp 


He 


Cys 


His 


Asp 


Lys 


Ser 


Trp 


Asp 


Gin 


Tyr 


Trp 


Leu 






245 










250 










255 




Trp 


Ala 


Ser 


Gin 


Glu 


Ser 


Tyr 


Ala 


Ala 


Tyr 


Leu 


Arg 


Ser 


He 


Gly 


He 






260 










265 










270 






Asd 


Ala 


Trp 

tr 

275 


Arg 


Phe 


Asp 


Tyr 


Val 
280 


Lys 


Gly 


Tyr 


Gly 


Ala 
285 


Trp 


Val 


Val 


Lvs 


Asd 
290 


Trp 

tr 


Leu 


Asn 


Trp 

tr 


Trp 

295 


Gly 


Gly 


Trp 


Ala 


Val 
300 


Gly 


Glu 


Tyr 


Trp 


Asp 


Thr 


Asn 


Val 


Asp 


Ala 


Leu 


Leu 


Asn 


Trp 


Ala 


Tyr 


Ser 


Ser 


Gly 


Ala 


305 










310 










315 










320 


Lvs 


Val 


Phe 


Asp 


Phe 


Pro 


Leu 


Tyr 


Tyr 


Lys 


Met 


Asp 


Ala 


Ala 


Phe 


Asp 








325 










330 










335 




Asn 


Lvs 


Asn 


He 


Pro 


Ala 


Leu 


Val 


Glu 


Ala 


Leu 


Lys 


Asn 


Gly 


Gly 


Thr 






340 










345 










350 






Val 


Val 


Ser 
355 


Arg 


Asp 


Pro 


Phe 


Lys 
360 


Ala 


Val 


Thr 


Phe 


Val 
365 


Ala 


Asn 


His 


Asd 


Thr 
370 


Asp 


He 


He 


Trp 


Asn 
375 


Lys 


Tyr 


Pro 


Ala 


Tyr 
380 


Ala 


Phe 


He 


Leu 


Thr 


Tyr 


Glu 


Gly 


Gin 


Pro 


Thr 


He 


Phe 


Tyr 


Arg 


Asp 


Tyr 


Glu 


Glu 


Trp 


385 






390 










395 










400 


Leu 


Asn 


Lys 


Asp 


Lys 


Leu 


Lys 


Asn 


Leu 


He 


Trp 


He 


His 


Asp 


Asn 


Leu 








405 










410 










415 




Ala 


Gly 


Gly 


Ser 


Thr 


Ser 


He 


Val 


Tyr 


Tyr 


Asp 


Ser 


Asp 


Glu 


Met 


He 




420 










425 










430 






Phe 


Val 


Arg 
435 


Asn 


Gly 


Tyr 


Gly 


Ser 
440 


Lys 


Pro 


Gly 


Leu 


He 
445 


Thr 


Tyr 


He 


Asn 


Leu 
450 


Gly 


Ser 


Ser 


Lys 


val 

455 


Gly 


Arg 


Trp 


Val 


Tyr 
460 


Val 


Pro 


Lys 


Phe 


Ala 


Gly 


Ala 


Cys 


He 


His 


Glu 


Tyr 


Thr 


Gly 


Asn 


Leu 


Gly 


Gly 


Trp 


Val 


465 








470 










475 










480 


Asp 


Lys 


Tyr 


Val 


Tyr 


Ser 


Ser 


Gly 


Trp 


Val 


Tyr 


Leu 


Glu 


Ala 


Pro 


Ala 








485 










490 










495 




Tyr 


Asp 


Pro 


Ala 


Asn 


Gly 


Gin 


Tyr 


Gly 


Tyr 


Ser 


Val 


Trp 


Ser 


Tyr 


Cys 






500 










505 










510 






Gly 


Val 


Gly 


Thr 


Ser 


He 


Ala 


Gly 


He 


Leu 


Glu 


Ala 


Asp 


Arg 


Val 


Leu 




515 










520 










525 








Thr 


Val 
530 


Ser 


Pro 


Tyr 


Tyr 


Ala 
535 


Glu 


Glu 


Leu 


He 


Ser 
540 


Gly 


He 


Ala 


Arg 


Gly 


Cys 


Glu 


Leu 


Asp 


Asn 


He 


Met 


Arg 


Leu 


Thr 


Gly 


He 


Thr 


Gly 


He 


545 








550 










555 










560 


Val 


Asn 


Gly 


Met 


Asp 
565 


Val 


Ser 


Glu 


Trp 


Asp 
570 


Pro 


Ser 


Arg 


Asp 


Lys 
575 


Tyr 


lie 


Ala 


Val 


Lys 
580 


Tyr 


Asp 


Val 


Ser 


Thr 
585 


Ala 


Val 


Glu 


Ala 


Lys 
590 


Ala 


Leu 


Asn 


Lys 


Glu 


Ala 


Leu 


Gin 


Ala 


Glu 


Val 


Gly 


Leu 


Pro 


Val 


Asp 


Arg 


Asn 




595 










600 










605 








He 


Pro 
610 


Leu 


Val 


Ala 


Phe 


He 
615 


Gly 


Arg 


Leu 


Glu 


Glu 
620 


Gin 


Lys 


Gly 


Pro 


Asp 


Val 


Met 


Ala 


Ala 


Ala 


He 


Pro 


Gin 


Leu 


Met 


Glu 


Met 


Val 


Glu 


Asp 



186 
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625 










630 










635 










640 


Val 


Gin 


lie 


Val 


Leu 


Leu 


Gly 


Thr 


Gly 


Lys 


Lys 


Lys 


Phe 


Glu 


Arg 


Met 










645 










650 










655 




Leu 


Met 


Ser 


Ala 


Glu 


Glu 


Lys 


Phe 


Pro 


Gly 


Lys 


Val 


Arg 


Ala 


Val 


Val 








660 










665 










670 






Lvs 


Phe 


Asn 


Ala 


Ala 


Leu 


Ala 


His 


His 


He 


Met 


Ala 


Gly 


Ala 


Asp 


Val 




675 










680 










685 








Leu 


Ala 


Val 


Thr 


Ser 


Arg 


Phe 


Glu 


Pro 


Cys 


Gly 


Leu 


He 


Gin 


Leu 


Gin 




690 










695 










700 










Gly 


Met 


Arg 


Tyr 


Gly 


Thr 


Pro 


Cys 


Ala 


Cys 


Ala 


Ser 


Thr 


Gly 


Gly 


Leu 


705 










710 










715 










720 


Val 


Asp 


Thr 


He 


He 


Glu 


Gly 


Lys 


Thr 


Gly 


Phe 


His 


Met 


Gly 


Arg 


Leu 








725 










730 










735 




Ser 


Val 


Asp 


Cys 


Asn 


Val 


Val 


Glu 


Pro 


Ala 


Asp 


Val 


Lys 


Lys 


Val 


Ala 






740 










745 










750 






Thr 


Thr 


Leu 


Gin 


Arg 


Ala 


He 


Lys 


Val 


Val 


Gly 


Thr 


Pro 


Ala 


Tyr 


Glu 






755 










760 










765 








Glu 


Met 


Val 


Arg 


Asn 


Cys 


Met 


He 


Gin 


Asp 


Leu 


Ser 


Trp 


Lys 


Gly 


Pro 




770 








775 










780 










Ala 


Lys 


Asn 


Trp 


Glu 


Asn 


Val 


Leu 


Leu 


Ser 


Leu 


Gly 


Val 


Ala 


Gly 


Gly 


785 






790 










795 










800 


Glu 


Pro 


Gly 


Val 


Glu 


Gly 


Glu 


Glu 


He 


Ala 


Pro 


Leu 


Ala 


Lys 


Glu 


Asn 



805 810 815 

Val Ala Ala Pro 

820 



<210> 17 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 17 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 

15 10 15 

Ala Thr Ser 



<210> 18 
<211> 444 
<212> PRT 

<213> Thermotoga maritima 
<400> 18 

Met Ala Glu Phe Phe Pro Glu He Pro Lys lie Gin Phe Glu Gly Lys 

X 5 10 15 

Glu Ser Thr Asn Pro Leu Ala Phe Arg Phe Tyr Asp Pro Asn Glu Val 

20 25 30 

lie Asp Gly Lys Pro Leu Lys Asp His Leu Lys Phe Ser Val Ala Phe 
35 40 45 
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Trp 

* 


His 


Thr 


Phe 


Val 


Asn 


Glu 


Gly 


Arg 


Asp 


Pro 


Phe 


Gly 


Asp 


Pro 


Thr 


50 










55 










60 










Ala 


Glu 


Arg 


Pro 


Trp 


Asn 


Arg 


Phe 


Ser 


Asp 


Pro 


Met 


Asp 


Lys 


Ala 


Phe 


65 








70 










75 










80 


Ala 


Arg 


Val 


Asp 


Ala 


Leu 


Phe 


Glu 


Phe 


Cys 


Glu 


Lys 


Leu 


Asn 


lie 


Glu 








85 










90 










95 




Tyr 


Phe 


Cys 


Phe 


His 


Asp 


Arg 


Asp 


He 


Ala 


Pro 


Glu 


Gly 


Lys 


Thr 


Leu 




100 










105 










110 






Arg 


Glu 


Thr 


Asn 


Lys 


He 


Leu 


Asp 


Lys 


Val 


Val 


Glu 


Arg 


He 


Lys 


Glu 




115 










120 










125 








Arg 


Met 


Lys 


Asp 


Ser 


Asn 


Val 


Lys 


Leu 


Leu 


Trp 


Gly 


Thr 


Ala 


Asn 


Leu 


130 










135 










140 










Phe 


Ser 


His 


Pro 


Arg 


Tyr 


Met 


His 


Gly 


Ala 


Ala 


Thr 


Thr 


Cys 


Ser 


Ala 


145 










150 










155 










160 


Asp 


Val 


Phe 


Ala 


Tyr 


Ala 


Ala 


Ala 


Gin 


Val 


Lys 


Lys 


Ala 


Leu 


Glu 


He 








165 










170 










175 




Thr 


Lys 


Glu 


Leu 


Gly 


Gly 


Glu 


Gly 


Tyr 


Val 


Phe 


Trp 


Gly 


Gly 


Arg 


Glu 






180 










185 










190 






Gly 


Tyr 


Glu 


Thr 


Leu 


Leu 


Asn 


Thr 


Asp 


Leu 


Gly 


Leu 


Glu 


Leu 


Glu 


Asn 


195 










200 










205 








Leu 


Ala 
210 


Arg 


Phe 


Leu 


Arg 


Met 

215 


Ala 


Val 


Glu 


Tyr 


Ala 

220 


Lys 


Lys 


He 


Gly 


Phe 


Thr 


Gly 


Gin 


Phe 


Leu 


He 


Glu 


Pro 


Lys 


Pro 


Lys 


Glu 


Pro 


Thr 


Lys 


225 








230 










235 










240 


His 


Gin 


Tyr 


Asp 


Phe 


Asp 


Val 


Ala 


Thr 


Ala 


Tyr 


Ala 


Phe 


Leu 


Lys 


Asn 








245 










250 










255 




His 


Gly 

•* 


Leu 


Asp 


Glu 


Tyr 


Phe 


Lys 


Phe 


Asn 


He 


Glu 


Ala 


Asn 


His 


Ala 






260 










265 










270 






Thr 


Leu 


Ala 
275 


Gly 


His 


Thr 


Phe 


Gin 
280 


His 


Glu 


Leu 


Arg 


Met 
285 


Ala 


Arg 


He 


Leu 


Gly 
290 


Lys 


Leu 


Gly 


Ser 


He 
295 


Asp 


Ala 


Asn 


Gin 


Gly 
300 


Asp 


Leu 


Leu 


Leu 


Gly 


Trp 


Asp 


Thr 


Asp 


Gin 


Phe 


Pro 


Thr 


Asn 


He 


Tyr 


Asp 


Thr 


Thr 


Leu 


305 










310 










315 










320 


Ala 


Met 


Tyr 


Glu 


Val 


He 


Lys 


Ala 


Gly 


Gly 


Phe 


Thr 


Lys 


Gly 


Gly 


Leu 








325 










330 










335 




Asn 


Phe 


Asp 


Ala 
340 


Lys 


Val 


Arg 


Arg 


Ala 
345 


Ser 


Tyr 


Lys 


Val 


Glu 
350 


Asp 


Leu 


Phe 


He 


Gly 
355 


His 


He 


Ala 


Gly 


Met 
360 


Asp 


Thr 


Phe 


Ala 


Leu 
365 


Gly 


Phe 


Lys 


He 


Ala 
370 


Tyr 


Lys 


Leu 


Ala 


Lys 
375 


Asp 


Gly 


Val 


Phe 


Asp 
380 


Lys 


Phe 


He 


Glu 


Glu 


Lys 


Tyr 


Arg 


Ser 


Phe 


Lys 


Glu 


Gly 


He 


Gly 


Lys 


Glu 


He 


Val 


Glu 


385 








390 










395 










400 


Gly 


Lys 


Thr 


Asp 


Phe 


Glu 


Lys 


Leu 


Glu 


Glu 


Tyr 


He 


He 


Asp 


Lys 


Glu 






405 










410 










415 




Asp 


He 


Glu 


Leu 


Pro 


Ser 


Gly 


Lys 


Gin 


Glu 


Tyr 


Leu 


Glu 


Ser 


Leu 


Leu 






420 










425 










430 






Asn 


Ser 


Tyr 


He 


Val 


Lys 


Thr 


He 


Ala 


Glu 


Leu 


Arg 











435 440 



<210> 19 
<211> 1335 
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<212> DNA 

<213> Thermotoga maritima 



<400> 19 

atggccgagt tcttcccgga gatcccgaag atccagttcg agggcaagga gtccaccaac 60 
ccgctcgcct tccgcttcta cgacccgaac gaggtgatcg acggcaagcc gctcaaggac 12 0 
cacctcaagt tctccgtggc cttctggcac accttcgtga acgagggccg cgacccgttc 180 
ggcgacccga ccgccgagcg cccgtggaac cgcttctccg acccgatgga caaggccttc 240 
gcccgcgtgg acgccctctt cgagttctgc gagaagctca acatcgagta cttctgcttc 300 
cacgaccgcg acatcgcccc ggagggcaag accctccgcg agaccaacaa gatcctcgac 360 
aaggtggtgg agcgcatcaa ggagcgcatg aaggactcca acgtgaagct cctctggggc 420 
accgccaacc tcttctccca cccgcgctac atgcacggcg ccgccaccac ctgctccgcc 480 
gacgtgttcg cctacgccgc cgcccaggtg aagaaggccc tggagatcac caaggagctg 540 
ggcggcgagg gctacgtgtt ctggggcggc cgcgagggct acgagaccct cctcaacacc 600 
gacctcggcc tggagctgga gaacctcgcc cgcttcctcc gcatggccgt ggagtacgcc 660 
aagaagatcg gcttcaccgg ccagttcctc atcgagccga agccgaagga gccgaccaag 720 
caccagtacg acttcgacgt ggccaccgcc tacgccttcc tcaagaacca cggcctcgac 780 
gagtacttca agttcaacat cgaggccaac cacgccaccc tcgccggcca caccttccag 840 
cacgagctgc gcatggcccg catcctcggc aagctcggct ccatcgacgc caaccagggc 900 
gacctcctcc tcggctggga caccgaccag ttcccgacca acatctacga caccaccctc 960 
gccatgtacg aggtgatcaa ggccggcggc ttcaccaagg gcggcctcaa cttcgacgcc 1020 
aaggtgcgcc gcgcctccta caaggtggag gacctcttca tcggccacat cgccggcatg 1080 
gacaccttcg ccctcggctt caagatcgcc tacaagctcg ccaaggacgg cgtgttcgac 1140 
aagttcatcg aggagaagta ccgctccttc aaggagggca tcggcaagga gatcgtggag 1200 
ggcaagaccg acttcgagaa gctggaggag tacatcatcg acaaggagga catcgagctg 1260 
ccgtccggca agcaggagta cctggagtcc ctcctcaact cctacatcgt gaagaccatc 1320 
gccgagctgc gctga 1335 



<210> 20 

<211> 444 

<212> PRT 

<213> Thermotoga neapolitana 



<400> 20 




Met 


Ala 


Glu 


Phe 


1 








Glu 


Ser 


Thr 


Asn 








20 


He 


Asp 


Gly 


Lys 






35 




Trp 


His 


Thr 


Phe 




50 






Ala 


Asp 


Arg 


Pro 


65 








Ala 


Arg 


Val 


Asp 


Tyr 


Phe 


Cys 


Phe 








100 


Arg 


Glu 


Thr 


Asn 






115 




Arg 


Met 


Lys 


Asp 




130 






Phe 


Ser 


His 


Pro 


145 









Phe Pro Glu He 
5 

Pro Leu Ala Phe 

Pro Leu Lys Asp 

40 

Val Asn Glu Gly 
55 

Trp Asn Arg Tyr 
70 

Ala Leu Phe Glu 
85 

His Asp Arg Asp 

Lys He Leu Asp 

120 

Ser Asn Val Lys 
135 

Arg Tyr Met His 
150 



Pro Lys Val Gin 
10 

Lys Phe Tyr Asp 
25 

His Leu Lys Phe 

Arg Asp Pro Phe 

60 

Thr Asp Pro Met 
75 

Phe Cys Glu Lys 
90 

He Ala Pro Glu 
105 

Lys Val Val Glu 

Leu Leu Trp Gly 

140 

Gly Ala Ala Thr 
155 

189 



Phe Glu Gly Lys 
15 

Pro Glu Glu He 
30 

Ser Val Ala Phe 
45 

Gly Asp Pro Thr 

Asp Lys Ala Phe 

80 

Leu Asn He Glu 
95 

Gly Lys Thr Leu 
110 

Arg He Lys Glu 
125 

Thr Ala Asn Leu 

Thr Cys Ser Ala 

160 
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Asp 


Val 


Phe 


Ala 


Tyr 


Ala 


Ala 


Ala 


Gin 


Val 


Lys 


Lys 


Ala 


Leu 


Glu 


He 










165 










170 










175 




Thr 


Lys 


Glu 


Leu 


Gly 


Gly 


Glu 


Gly 


Tyr 


Val 


Phe 


Trp 


Gly 


Gly 


Arg 


Glu 








180 










185 










190 






Gly 


Tyr 


Glu 


Thr 


Leu 


Leu 


Asn 


Thr 


Asp 


Leu 


Gly 


Phe 


Glu 


Leu 


Glu 


Asn 






195 










200 










205 








Leu 


Ala 


Arg 


Phe 


Leu 


Arg 


Met 


Ala 


Val 


Asp 


Tyr 


Ala 


Lys 


Arg 


He 


Gly 




210 










215 










220 










Phe 


Thr 


Gly 


Gin 


Phe 


Leu 


He 


Glu 


Pro 


Lys 


Pro 


Lys 


Glu 


Pro 


Thr 


Lys 


225 










230 










235 










240 


His 


Gin 


Tyr 


Asp 


Phe 


Asp 


Val 


Ala 


Thr 


Ala 


Tyr 


Ala 


Phe 


Leu 


Lys 


Ser 










245 










250 










255 




His 


Gly 


Leu 


Asp 


Glu 


Tyr 


Phe 


Lys 


Phe 


Asn 


He 


Glu 


Ala 


Asn 


His 


Ala 








260 










265 










270 






Thr 


Leu 


Ala 


Gly 


His 


Thr 


Phe 


Gin 


His 


Glu 


Leu 


Arg 


Met 


Ala 


Arg 


He 






275 










280 










285 








Leu 


Gly 


Lys 


Leu 


Gly 


Ser 


He 


Asp 


Ala 


Asn 


Gin 


Gly 


Asp 


Leu 


Leu 


Leu 




290 










295 










300 










Gly 


Trp 


Asp 


Thr 


Asp 


Gin 


Phe 


Pro 


Thr 


Asn 


Val 


Tyr 


Asp 


Thr 


Thr 


Leu 


305 










310 










315 










320 


Ala 


Met 


Tyr 


Glu 


Val 


He 


Lys 


Ala 


Gly 


Gly 


Phe 


Thr 


Lys 


Gly 


Gly 


Leu 










325 










330 










335 




Asn 


Phe 


Asp 


Ala 


Lys 


val 


Arg 


Arg 


Ala 


Ser 


Tyr 


Lys 


val 


Glu 


Asp 


Leu 








340 










345 










350 






Phe 


lie 


Gly 


His 


He 


Ala 


Gly 


Met 


Asp 


Thr 


Phe 


Ala 


Leu 


Gly 


Phe 


Lys 






355 










360 










365 








Val 


Ala 


Tyr 


Lys 


Leu 


Val 


Lys 


Asp 


Gly 


Val 


Leu 


Asp 


Lys 


Phe 


He 


Glu 




370 










375 










380 










Glu 


Lys 


Tyr 


Arg 


Ser 


Phe 


Arg 


Glu 


Gly 


He 


Gly 


Arg 


Asp 


He 


Val 


Glu 


385 










390 










395 










400 


Gly 


Lys 


Val 


Asp 


Phe 


Glu 


Lys 


Leu 


Glu 


Glu 


Tyr 


He 


He 


Asp 


Lys 


Glu 










405 










410 










415 




Thr 


He 


Glu 


Leu 


Pro 


Ser 


Gly 


Lys 


Gin 


Glu 


Tyr 


Leu 


Glu 


Ser 


Leu 


He 








420 










425 










430 






Asn 


Ser 


Tyr 


He 


Val 


Lys 


Thr 


He 


Leu 


Glu 


Leu 


Arg 











435 440 



<210> 21 
<211> 1335 
<212> DNA 

<213> Thermotoga neapolitana 
<400> 21 

atggccgagt tcttcccgga gatcccgaag gtgcagttcg agggcaagga gtccaccaac 60 
ccgctcgcct tcaagttcta cgacccggag gagatcatcg acggcaagcc gctcaaggac 120 
cacctcaagt tctccgtggc cttctggcac accttcgtga acgagggccg cgacccgttc 180 
ggcgacccga ccgccgaccg cccgtggaac cgctacaccg acccgatgga caaggccttc 240 
gcccgcgtgg acgccctctt cgagttctgc gagaagctca acatcgagta cttctgcttc 300 
cacgaccgcg acatcgcccc ggagggcaag accctccgcg agaccaacaa gatcctcgac 360 
aaggtggtgg agcgcatcaa ggagcgcatg aaggactcca acgtgaagct cctctggggc 42 0 
accgccaacc tcttctccca cccgcgctac atgcacggcg ccgccaccac ctgctccgcc 480 
gacgtgttcg cctacgccgc cgcccaggtg aagaaggccc tggagatcac caaggagctg 54 0 
ggcggcgagg gctacgtgtt ctggggcggc cgcgagggct acgagaccct cctcaacacc 600 
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gacctcggct tcgagctgga gaacctcgcc cgcttcctcc gcatggccgt ggactacgcc 660 
aagcgcatcg gcttcaccgg ccagttcctc atcgagccga agccgaagga gccgaccaag 720 
caccagtacg acttcgacgt ggccaccgcc tacgccttcc tcaagtccca cggcctcgac 780 
gagtacttca agttcaacat cgaggccaac cacgccaccc tcgccggcca caccttccag 840 
cacgagctgc gcatggcccg catcctcggc aagctcggct ccatcgacgc caaccagggc 900 
gacctcctcc tcggctggga caccgaccag ttcccgacca acgtgtacga caccaccctc 960 
gccatgtacg aggtgatcaa ggccggcggc ttcaccaagg gcggcctcaa cttcgacgcc 1020 
aaggtgcgcc gcgcctccta caaggtggag gacctcttca tcggccacat cgccggcatg 1080 
gacaccttcg ccctcggctt caaggtggcc tacaagctcg tgaaggacgg cgtgctcgac 1140 
aagttcatcg aggagaagta ccgctccttc cgcgagggca tcggccgcga catcgtggag 1200 
ggcaaggtgg acttcgagaa gctggaggag tacatcatcg acaaggagac catcgagctg 1260 
ccgtccggca agcaggagta cctggagtcc ctcatcaact cctacatcgt gaagaccatc 1320 
ctggagctgc gctga 1335 

<210> 22 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic 



<400> 22 

agcgaattca tggcggctct ggccacgt 

<210> 23 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 23 

agctaagctt cagggcgcgg ccacgttct 

<210> 24 
<211> 825 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> synthetic 



<400> 24 




























Met 


Arg 


Val 


Leu 


Leu 


Val 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 


1 






5 










10 










15 




Ala 


Thr 


Ser 


Ala 
20 


Gly 


His 


Trp 


Tyr 


Lys 
25 


His 


Gin 


Arg 


Ala 


Tyr 
30 


Gin 


Phe 


Thr 


Gly 


Glu 

35 


Asp 


Asp 


Phe 


Gly 


Lys 
40 


Val 


Ala 


Val 


Val 


Lys 
45 


Leu 


Pro 


Met 


Asp 


Leu 
50 


Thr 


Lys 


Val 


Gly 


He 
55 


He 


Val 


Arg 


Leu 


Asn 
60 


Glu 


Trp 


Gin 


Ala 


Lys 


Asp 


val 


Ala 


Lys 


Asp 


Arg 


Phe 


He 


Glu 


He 


Lys 


Asp 


Gly 


Lys 


Ala 
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65 










70 










75 






- 




80 


Glu 


Val 


Trp 


He 


Leu 


Gin 


Gly 


Val 


Glu 


Glu 


He 


Phe 


Tyr 


Glu 


Lys 


Pro 








85 










90 










95 




Asp 


Thr 


Ser 


Pro 


Arg 


He 


Phe 


Phe 


Ala 


Gin 


Ala 


Arg 


Ser 


Asn 


Lys 


Val 






100 










105 










110 






lie 


Glu 


Ala 


Phe 


Leu 


Thr 


Asn 


Pro 


Val 


Asp 

A. 


Thr 


Lys 


Lys 


Lys 


Glu 


Leu 






115 










120 










125 








Phe 

A. A. A 


Lys 


Val 


Thr 


Val 


Asp 


Gly 


Lys 


Glu 


He 


Pro 


Val 


Ser 


Arg 


Val 


Glu 




130 










135 










140 










Lvs 


Ala 


Asp 


Pro 


Thr 


Asp 


He 


Asp 


Val 


Thr 


Asn 


Tyr 


Val 


Arg 


He 


Val 


145 










150 










155 










160 


Leu 

J—J N*l 


Ser 


Glu 


Ser 


Leu 


Lys 


Glu 


Glu 


Asp 


Leu 


Arg 


Lys 


Asp 


Val 


Glu 


Leu 










165 










170 










175 




lie 

-i- -i- 


He 


Glu 


Gly 


Tyr 


Lys 


Pro 


Ala 


Arg 


Val 


He 


Met 


Met 


Glu 


He 


Leu 








180 










185 










190 








Asn 


Tvr 


Tvr 


Tvr 


Asp 


Gly 


Glu 


Leu 


Gly 


Ala 


Val 


Tyr 


Ser 


Pro 


Glu 






195 










200 










205 










Thr 

A- A A A* 


He 


Phe 


Arq 


Val 


Trp 


Ser 


Pro 


Val 


Ser 


Lys 


Trp 


Val 


Lys 


Val 


210 










215 










220 










J—f u 


TjPU 

J—t \J* 


Phe 

A, A A 


Lvs 


Asn 


Gly 


Glu 


Asp 


Thr 


Glu 


Pro 


Tyr 


Gin 


Val 


Val 


Asn 


225 










230 










235 










240 


Met 


Glu 


Tyr 


Lys 


Gly 


Asn 


Gly 


Val 


Trp 

AT 


Glu 


Ala 


Val 


Val 


Glu 


Gly 


Asp 










245 










250 










255 




Leu 


Asd 


Gly 


Val 


Phe 


Tyr 


Leu 


Tyr 


Gin 


Leu 


Glu 


Asn 


Tyr 


Gly 


Lys 


He 








260 










265 










270 








Thr 


Thr 

^ A A *k 


Val 


Asp 


Pro 


Tyr 


Ser 


Lys 


Ala 


Val 


Tyr 


Ala 


Asn 


Asn 


Gin 




275 










280 










285 








Glu 


Ser 

k—V N_*> JLm 


Ala 


Val 


Val 


Asn 


Leu 


Ala 


Arg 


Thr 


Asn 


Pro 


Glu 


Gly 


Trp 


Glu 




290 










295 










300 










Asn 


Asd 


Arg 


Gly 


Pro 


Lys 


He 


Glu 


Gly 


Tyr 


Glu 


Asp 

Aa 


Ala 


He 


He 


Tyr 


3 05 








310 










315 










320 


Glu 


He 


His 


He 


Ala 


Asp 


He 


Thr 


Gly 


Leu 


Glu 


Asn 


Ser 


Gly 


Val 


Lys 










325 










330 










335 




Asn 


Lys 


Gly 


Leu 


Tyr 


Leu 


Gly 


Leu 


Thr 


Glu 


Glu 


Asn 


Thr 


Lys 


Ala 


Pro 








340 










345 










350 






Glv 


Glv 


Val 


Thr 


Thr 


Gly 


Leu 


Ser 


His 


Leu 


Val 


Glu 


Leu 


Gly 


Val 


Thr 




355 










360 










365 








His 


Val 


His 


He 


Leu 


Pro 


Phe 


Phe 


Asp 


Phe 


Tyr 


Thr 


Gly 


Asp 


Glu 


Leu 




370 










375 










380 










Asp 


Lys 

"2 


Asp 


Phe 


Glu 


Lys 


Tyr 


Tyr 


Asn 


Trp 


Gly 


Tyr 


Asp 


Pro 


Tyr 


Leu 


385 








390 










395 










400 


Phe 


Met 


Val 


Pro 


Glu 


Gly 


Arg 


Tyr 


Ser 


Thr 


Asp 


Pro 


Lys 


Asn 


Pro 


His 










405 










410 










415 




Thr 


Arg 


He 


Arg 


Glu 


Val 


Lys 


Glu 


Met 


Val 


Lys 


Ala 


Leu 


His 


Lys 


His 






420 










425 










430 






Gly 


He 


Gly 

J 


Val 


He 


Met 


Asp 


Met 


Val 


Phe 


Pro 


His 


Thr 


Tyr 


Gly 


He 






435 










440 










445 








Gly 


Glu 


Leu 


Ser 


Ala 


Phe 


Asp 


Gin 


Thr 


Val 


Pro 


Tyr 


Tyr 


Phe 


Tyr 


Arg 


450 










455 










460 










He 


Asp 


Lys 


Thr 


Gly 


Ala 


Tyr 


Leu 


Asn 


Glu 


Ser 


Gly 


Cys 


Gly 


Asn 


Val 


465 










470 










475 










480 


He 


Ala 


Ser 


Glu 


Arg 


Pro 


Met 


Met 


Arg 


Lys 


Phe 


He 


Val 


Asp 


Thr 


Val 










485 










490 










495 




Thr 


Tyr 


Trp 


Val 


Lys 


Glu 


Tyr 


His 


He 


Asp 


Gly 


Phe 


Arg 


Phe 


Asp 


Gin 
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500 

Met Gly Leu lie 
515 

His Lys lie Asp 
530 

Trp Gly Ala Pro 
545 

Val Ala Ala Phe 

Phe Asn Pro Ser 

580 

Thr Lys lie Lys 
595 

Leu lie Lys Ser 
610 

Ala Cys His Asp 
625 

Lys Ala Asp Lys 

Gin Lys Leu Ala 

660 

Leu His Gly Gly 
675 

Ser Tyr Asn Ala 
690 

Leu Gin Phe lie 
705 

Arg Lys Glu His 

Lys His Leu Glu 

740 

Leu Lys Asp His 
755 

Tyr Asn Gly Asn 
770 

Trp Asn Val Val 
785 

Thr Val Glu Gly 

Tyr Arg Glu Ser 

820 



Asp Lys Lys Thr 

520 

Pro Thr lie lie 
535 

lie Arg Phe Gly 
550 

Asn Asp Glu Phe 
565 

Val Lys Gly Phe 

Arg Gly Val Val 

600 

Phe Ala Leu Asp 
615 

Asn His Thr Leu 
630 

Lys Lys Glu Trp 
645 

Gly Ala lie Leu 

Gin Asp Phe Cys 

680 

Pro lie Ser lie 
695 

Asp Val Phe Asn 
710 

Pro Ala Phe Arg 
725 

Phe Leu Pro Gly 

Ala Gly Gly Asp 

760 

Leu Glu Lys Thr 
775 

Val Asn Ser Gin 
790 

Thr lie Glu Leu 
805 

Glu Lys Asp Glu 



505 

Met Leu Glu Val 

Leu Tyr Gly Glu 

540 

Lys Ser Asp Val 
555 

Arg Asp Ala lie 
570 

Val Met Gly Gly 
585 

Gly Ser lie Asn 

Pro Glu Glu Thr 

620 

Trp Asp Lys Asn 
635 

Thr Glu Glu Glu 
650 

Leu Thr Ser Gin 
665 

Arg Thr Thr Asn 

Asn Gly Phe Asp 

700 

Tyr His Lys Gly 
715 

Leu Lys Asn Ala 
730 

Gly Arg Arg lie 
745 

Pro Trp Lys Asp 

Thr Tyr Lys Leu 

780 

Lys Ala Gly Thr 
795 

Asp Pro Leu Ser 
810 

Leu 
825 





510 






Glu 


Arg 


Ala 


Leu 


525 








Pro 


Trp 


Gly 


Gly 


Ala 


Gly 


Thr 


His 








560 


Arg 


Gly 


Ser 


Val 






575 




Tyr 


Gly 


Lys 


Glu 




590 






Tyr 


Asp 


Gly 


Lys 


605 








lie 


Asn 


Tyr 


Ala 


Tyr 


Leu 


Ala 


Ala 








640 


Leu 


Lys 


Asn 


Ala 






655 




Gly 


Val 


Pro 


Phe 




670 






Phe 


Asn 


Asp 


Asn 


685 








Tyr 


Glu 


Arg 


Lys 


Leu 


He 


Lys 


Leu 








720 


Glu 


Glu 


-r- -| 

He 


Lys 






735 




Val 


Ala 


Phe 


Met 




750 






Tie 

IXC 


Val 


Val 


He 


765 








Pro 


Glu 


Gly 


Lys 


Glu 


Val 


He 


Glu 








800 


Ala 


Tyr 


Val 


Leu 






815 





<210> 25 
<211> 2478 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 25 

atgagggtgt tgctcgttgc cctcgctctc ctggctctcg ctgcgagcgc caccagcgct 60 
ggccactggt acaagcacca gcgcgcctac cagttcaccg gcgaggacga cttcgggaag 120 
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gtggccgtgg tgaagctccc gatggacctc accaaggtgg gcatcatcgt gcgcctcaac 180 
gagtggcagg cgaaggacgt ggccaaggac cgcttcatcg agatcaagga cggcaaggcc 24 0 
gaggtgtgga tactccaggg cgtggaggag atcttctacg agaagccgga cacctccccg 3 00 
cgcatcttct tcgcccaggc ccgctccaac aaggtgatcg aggccttcct caccaacccg 360 
gtggacacca agaagaagga gctgttcaag gtgaccgtcg acggcaagga gatcccggtg 420 
tcccgcgtgg agaaggccga cccgaccgac atcgacgtga ccaactacgt gcgcatcgtg 4 80 
ctctccgagt ccctcaagga ggaggacctc cgcaaggacg tggagctgat catcgagggc 540 
tacaagccgg cccgcgtgat catgatggag atcctcgacg actactacta cgacggcgag 600 
ctgggggcgg tgtactcccc ggagaagacc atcttccgcg tgtggtcccc ggtgtccaag 660 
tgggtgaagg tgctcctctt caagaacggc gaggacaccg agccgtacca ggtggtgaac 720 
atggagtaca agggcaacgg cgtgtgggag gccgtggtgg agggcgacct cgacggcgtg 780 
ttctacctct accagctgga gaactacggc aagatccgca ccaccgtgga cccgtactcc 840 
aaggccgtgt acgccaacaa ccaggagtct gcagtggtga acctcgcccg caccaacccg 900 
gagggctggg agaacgaccg cggcccgaag atcgagggct acgaggacgc catcatctac 960 
gagatccaca tcgccgacat caccggcctg gagaactccg gcgtgaagaa caagggcctc 1020 
tacctcggcc tcaccgagga gaacaccaag gccccgggcg gcgtgaccac cggcctctcc 1080 
cacctcgtgg agctgggcgt gacccacgtg cacatcctcc cgttcttcga cttctacacc 1140 
ggcgacgagc tggacaagga cttcgagaag tactacaact ggggctacga cccgtacctc 1200 
ttcatggtgc cggagggccg ctactccacc gacccgaaga acccgcacac ccgaattcgc 1260 
gaggtgaagg agatggtgaa ggccctccac aagcacggca tcggcgtgat catggacatg 1320 
gtgttcccgc acacctacgg catcggcgag ctgtccgcct tcgaccagac cgtgccgtac 1380 
tacttctacc gcatcgacaa gaccggcgcc tacctcaacg agtccggctg cggcaacgtg 1440 
atcgcctccg agcgcccgat gatgcgcaag ttcatcgtgg acaccgtgac ctactgggtg 1500 
aaggagtacc acatcgacgg cttccgcttc gaccagatgg gcctcatcga caagaagacc 1560 
atgctggagg tggagcgcgc cctccacaag atcgacccga ccatcatcct ctacggcgag 1620 
ccgtggggcg gctggggggc cccgatccgc ttcggcaagt ccgacgtggc cggcacccac 1680 
gtggccgcct tcaacgacga gttccgcgac gccatccgcg gctccgtgtt caacccgtcc 1740 
gtgaagggct tcgtgatggg cggctacggc aaggagacca agatcaagcg cggcgtggtg 1800 
ggctccatca actacgacgg caagctcatc aagtccttcg ccctcgaccc ggaggagacc 1860 
atcaactacg ccgcctgcca cgacaaccac accctctggg acaagaacta cctcgccgcc 1920 
aaggccgaca agaagaagga gtggaccgag gaggagctga agaacgccca gaagctcgcc 1980 
ggcgccatcc tcctcactag tcagggcgtg ccgttcctcc acggcggcca ggacttctgc 2040 
cgcaccacca acttcaacga caactcctac aacgccccga tctccatcaa cggcttcgac 2100 
tacgagcgca agctccagtt catcgacgtg ttcaactacc acaagggcct catcaagctc 2160 
cgcaaggagc acccggcctt ccgcctcaag aacgccgagg agatcaagaa gcacctggag 2220 
ttcctcccgg gcgggcgccg catcgtggcc ttcatgctca aggaccacgc cggcggcgac 2280 
ccgtggaagg acatcgtggt gatctacaac ggcaacctgg agaagaccac ctacaagctc 2340 
ccggagggca agtggaacgt ggtggtgaac tcccagaagg ccggcaccga ggtgatcgag 2400 
accgtggagg gcaccatcga gctggacccg ctctccgcct acgtgctcta ccgcgagtcc 2460 
gagaaggacg agctgtga 2478 

<210> 26 
<211> 718 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 26 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 

15 10 15 

Ala Thr Ser Met Glu Thr lie Lys lie Tyr Glu Asn Lys Gly Val Tyr 

20 25 30 
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Lys 


Val 


Val 


He 


Gly 


Glu 


Pro 


Phe 


Pro 


Pro 


He 


Glu 


Phe 


Pro 


Leu 


Glu 




35 










40 










45 








Gin 


Lys 
50 


lie 


Ser 


Ser 


Asn 


Lys 
55 


Ser 


Leu 


Ser 


Glu 


Leu 
60 


Gly 


Leu 


Thr 


He 


Val 


Gin 


Gin 


Gly 


Asn 


Lys 


Val 


He 


Val 


Glu 


Lys 


Ser 


Leu 


Asp 


Leu 


Lys 


65 










70 










75 










80 


Glu 


His 


He 


He 


Gly 
85 


Leu 


Gly 


Glu 


Lys 


Ala 
90 


Phe 


Glu 


Leu 


Asp 


Arg 
95 


Lys 


Arg 


Lys 


Arg 


Tyr 


val 


Met 


Tyr 


Asn 


Val 


Asp 


Ala 


Gly 


Ala 


Tyr 


Lys 


Lys 






100 










105 










110 






Tyr 


Gin 


Asp 


Pro 


Leu 


Tyr 


Val 


Ser 


He 


Pro 


Leu 


Phe 


He 


Ser 


Val 


Lys 




115 










120 










125 








Asp 

A- 


Gly 


Val 


Ala 


Thr 


Gly 


Tyr 


Phe 


Phe 


Asn 


Ser 


Ala 


Ser 


Lys 


Val 


He 


130 










135 










140 










Phe 


Asp 


Val 


Gly 


Leu 


Glu 


Glu 


Tyr 


Asp 


Lys 


Val 


He 


Val 


Thr 


lie 


Pro 


145 








150 










155 










160 


Glu 


Asp 


Ser 


Val 


Glu 


Phe 


Tyr 


Val 


He 


Glu 


Gly 


Pro 


Arg 


He 


Glu 


Asp 








165 










170 










175 




Val 


Leu 


Glu 


Lys 
180 


Tyr 


Thr 


Glu 


Leu 


Thr 
185 


Gly 


Lys 


Pro 


Phe 


Leu 
190 


Pro 


Pro 


Met 


Trp 


Ala 
195 


Phe 


Gly 


Tyr 


Met 


He 
200 


Ser 


Arg 


Tyr 


Ser 


Tyr 
205 


Tyr 


Pro 


Gin 


Asp 


Lys 


Val 


Val 


Glu 


Leu 


Val 


Asp 


He 


Met 


Gin 


Lys 


Glu 


Gly 


Phe 


Arg 


210 










215 










220 










Val 


Ala 


Gly 


Val 


Phe 


Leu 


Asp 


He 


His 


Tyr 


Met 


Asp 


Ser 


Tyr 


Lys 


Leu 


225 








230 










235 










240 


Phe 


Thr 


Trp 


His 


Pro 


Tyr 


Arg 


Phe 


Pro 


Glu 


Pro 


Lys 


Lys 


Leu 


He 


Asp 








245 










250 










255 




Glu 


Leu 


His 


Lys 
260 


Arg 


Asn 


Val 


Lys 


Leu 
265 


He 


Thr 


He 


Val 


Asp 

270 


His 


Gly 


lie 


Arg 


Val 
275 


Asp 


Gin 


Asn 


Tyr 


Ser 
280 


Pro 


Phe 


Leu 


Ser 


Gly 
285 


Met 


Gly 


Lys 


Phe 


Cys 
290 


Glu 


He 


Glu 


Ser 


Gly 
295 


Glu 


Leu 


Phe 


Val 


Gly 
300 


Lys 


Met 


Trp 


Pro 


Gly 


Thr 


Thr 


Val 


Tyr 


Pro 


Asp 


Phe 


Phe 


Arg 


Glu 


Asp 


Thr 


Arg 


Glu 


Trp 


305 










310 










315 










320 


Trp 


Ala 


Gly 


Leu 


He 


Ser 


Glu 


Trp 


Leu 


Ser 


Gin 


Gly 


Val 


Asp 


Gly 


He 








325 










330 










335 




Trp 


Leu 


Asp 


Met 


Asn 


Glu 


Pro 


Thr 


Asp 


Phe 


Ser 


Arg 


Ala 


He 


Glu 


He 






340 










345 










350 






Arg 


Asp 


Val 


Leu 


Ser 


Ser 


Leu 


Pro 


Val 


Gin 


Phe 


Arg 


Asp 


Asp 


Arg 


Leu 


355 










360 










365 








Val 


Thr 

370 


Thr 


Phe 


Pro 


Asp 


Asn 
375 


Val 


Val 


His 


Tyr 


Leu 
380 


Arg 


Gly 


Lys 


Arg 


val 


Lys 


His 


Glu 


Lys 


Val 


Arg 


Asn 


Ala 


Tyr 


Pro 


Leu 


Tyr 


Glu 


Ala 


Met 


385 








390 










395 










400 


Ala 


Thr 


Phe 


Lys 


Gly 
405 


Phe 


Arg 


Thr 


Ser 


His 
410 


Arg 


Asn 


Glu 


He 


Phe 
415 


He 


Leu 


Ser 


Arg 


Ala 
420 


Gly 


Tyr 


Ala 


Gly 


He 
425 


Gin 


Arg 


Tyr 


Ala 


Phe 
430 


He 


Trp 


Thr 


Gly 


Asp 
435 


Asn 


Thr 


Pro 


Ser 


Trp 
440 


Asp 


Asp 


Leu 


Lys 


Leu 
445 


Gin 


Leu 


Gin 


Leu 


Val 
450 


Leu 


Gly 


Leu 


Ser 


He 
455 


Ser 


Gly 


Val 


Pro 


Phe 
460 


Val 


Gly 


Cys 


Asp 
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lie 


Gly 


Gly 


Phe 


Gin 


Gly 


Arg 


Asn 


465 










470 






Asp 


Leu 


Leu 


Val 


Lys 


Tyr 


Tyr 


Ala 










485 








Arg 


Ser 


His 


Lys 


Ala 


Thr 


Asp 


Gly 








500 










Pro 


Asp 


Tyr 


Tyr 


Lys 


Glu 


Lys 


Val 






515 










520 


Lys 

-* 


Phe 


Leu 


Pro 


Tyr 


He 


Tyr 


Ser 




530 










535 




Gly 


His 


Pro 


Val 


He 


Arg 


Pro 


Leu 


545 










550 






Asp 


Met 


Tyr 


Arg 


He 


Glu 


Asp 


Glu 










565 








Tyr 


Ala 


Pro 


lie 


Val 


Ser 


Lys 


Glu 








580 










Arq 


Gly 


Lys 


Trp 


Tyr 


Asn 


Tyr 


Trp 






595 










600 


Ser 


Val 


Val 


Lys 


Ser 


Thr 


His 


Glu 




610 










615 




Ser 


lie 


lie 


Pro 


Leu 


Glu 


Gly 


Asp 


625 










630 






Ser 


Phe 


Lys 


Arg 


Tyr 


Asp 


Asn 


Ala 










645 








lie 


Lvs 


Phe 


Ser 


Arq 


Glu 


He 


Tyr 








660 










Glu 


Lys 


Pro 


Val 


Ser 


Lys 


He 


lie 






675 










680 


Val 


Glu 


Lys 


Thr 


Met 


Gin 


Asn 


Thr 




690 










695 




lie 


Arg 


Gly 


Lys 


He 


Asn 


Leu 


Glu 


705 










710 







Phe 


Ala 


Glu 


He 


Asp 


Asn 


Ser 


Met 






475 










480 


Leu 


Ala 


Leu 


Phe 


Phe 


Pro 


Phe 


Tyr 




490 










495 




He 


Asp 


Thr 


Glu 


Pro 


Val 


Phe 


Leu 


505 










510 






Lys 


Glu 


He 


Val 


Glu 


Leu 


Arg 


Tyr 










525 








Leu 


Ala 


Leu 


Glu 


Ala 


Ser 


Glu 


Lys 








540 










Phe 


Tyr 


Glu 


Phe 


Gin 


Asp 


Asp 


Asp 






555 










560 


Tyr 


Met 


val 


Gly 


Lys 


Tyr 


Leu 


Leu 




570 










575 




Glu 


Ser 


Arg 


Leu 


Val 


Thr 


Leu 


Pro 


585 










590 






Asn 


Gly 


Glu 


He 


He 


Asn 


Gly 


Lys 










605 








Leu 


Pro 


He 


Tyr 


Leu 


Arg 


Glu 


Gly 








620 










Glu 


Leu 


He 


Val 


Tyr 


Gly 


Glu 


Thr 






635 










640 


Glu 


He 


Thr 


Ser 


Ser 


Ser 


Asn 


Glu 




650 










655 




Val 


Ser 


Lys 


Leu 


Thr 


He 


Thr 


Ser 


665 










670 






Val 


Asp 


Asp 


Ser 


Lys 


Glu 


He 


Gin 










685 








Tyr 


Val 


Ala 


Lys 


He 


Asn 


Gin 


Lys 








700 










Ser 


Glu 


Lys 


Asp 


Glu 


Leu 







<210> 27 
<211> 712 
<212> PRT 
<213> Artificia 

<220> 

<223> synthetic 

<400> 27 
Met Arg Val Leu 
1 

Ala Thr Ser Met 

20 

Lys Val Val He 
35 

Gin Lys He Ser 
50 

Val Gin Gin Gly 
65 

Glu His He He 



Sequence 



Leu Val Ala Leu 

5 

Glu Thr He Lys 

Gly Glu Pro Phe 

40 

Ser Asn Lys Ser 
55 

Asn Lys Val He 
70 

Gly Leu Gly Glu 



Ala Leu Leu Ala 
10 

He Tyr Glu Asn 
25 

Pro Pro He Glu 

Leu Ser Glu Leu 

60 

Val Glu Lys Ser 
75 

Lys Ala Phe Glu 
196 



Leu Ala Ala Ser 
15 

Lys Gly Val Tyr 
30 

Phe Pro Leu Glu 
45 

Gly Leu Thr He 

Leu Asp Leu Lys 

80 

Leu Asp Arg Lys 
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85 










90 










95 




Arg 


Lys 


Arg 


Tyr 


Val 


Met 


Tyr 


Asn 


Val 


Asp 


Ala 


Gly 


Ala 


Tyr 


Lys 


Lys 








100 










105 










110 






Tyr 


Gin 


Asp 


Pro 


Leu 


Tyr 


Val 


Ser 


He 


Pro 


Leu 


Phe 


He 


Ser 


Val 


Lys 




115 










120 










125 








Asp 


Gly 


Val 


Ala 


Thr 


Gly 


Tyr 


Phe 


Phe 


Asn 


Ser 


Ala 


Ser 


Lys 


Val 


He 




130 










135 










140 










Phe 


Asp 


Val 


Gly 


Leu 


Glu 


Glu 


Tyr 


Asp 


Lys 


Val 


He 


Val 


Thr 


He 


Pro 


145 










150 










155 










160 


Glu 


Asp 


Ser 


val 


Glu 


Phe 


Tyr 


Val 


lie 


Glu 


Gly 


Pro 


Arg 


He 


Glu 


Asp 










165 










170 










175 




Val 


Leu 


Glu 


Lys 


Tyr 


Thr 


Glu 


Leu 


Thr 


Gly 


Lys 


Pro 


Phe 


Leu 


Pro 


Pro 








180 










185 










190 




• 


Met 


Trp 


Ala 


Phe 


Gly 


Tyr 


Met 


He 


Ser 


Arg 


Tyr 


Ser 


Tyr 


Tyr 


Pro 


Gin 






195 










200 










205 








Asp 


Lys 


Val 


Val 


Glu 


Leu 


Val 


Asp 


He 


Met 


Gin 


Lys 


Glu 


Gly 


Phe 


Arg 




210 










215 










220 










Val 


Ala 


Gly 


Val 


Phe 


Leu 


Asp 


He 


His 


Tyr 


Met 


Asp 


Ser 


Tyr 


Lys 


Leu 


225 










230 










235 










240 


Phe 


Thr 


Trp 


His 


Pro 


Tyr 


Arg 


Phe 


Pro 


Glu 


Pro 


Lys 


Lys 


Leu 


He 


Asp 










245 










250 










255 




Glu 


Leu 


His 


Lys 


Arg 


Asn 


Val 


Lys 


Leu 


He 


Thr 


He 


Val 


Asp 


His 


Gly 








260 










265 










270 






lie 


Arg 


Val 


Asp 


Gin 


Asn 


Tyr 


Ser 


Pro 


Phe 


Leu 


Ser 


Gly 


Met 


Gly 


Lys 






275 










280 










285 








Phe 


Cys 


Glu 


lie 


Glu 


Ser 


Gly 


Glu 


Leu 


Phe 


Val 


Gly 


Lys 


Met 


Trp 


Pro 




290 










295 










300 










Gly 


Thr 


Thr 


Val 


Tyr 


Pro 


Asp 


Phe 


Phe 


Arg 


Glu 


Asp 


Thr 


Arg 


Glu 


Trp 


305 










310 










315 










320 


Trp 


Ala 


Gly 


Leu 


He 


Ser 


Glu 


Trp 


Leu 


Ser 


Gin 


Gly 


Val 


Asp 


Gly 


He 








325 










330 










335 




Trp 


Leu 


Asp 


Met 


Asn 


Glu 


Pro 


Thr 


Asp 


Phe 


Ser 


Arg 


Ala 


He 


Glu 


He 






340 






■• 




345 










350 






Arg 


Asp 


Val 


Leu 


Ser 


Ser 


Leu 


Pro 


Val 


Gin 


Phe 


Arg 


Asp 


Asp 


Arg 


Leu 






355 










360 










365 








Val 


Thr 


Thr 


Phe 


Pro 


Asp 


Asn 


Val 


Val 


His 


Tyr 


Leu 


Arg 


Gly 


Lys 


Arg 




370 










375 










380 










Val 


Lys 


His 


Glu 


Lys 


Val 


Arg 


Asn 


Ala 


Tyr 


Pro 


Leu 


Tyr 


Glu 


Ala 


Met 


385 








390 










395 










400 


Ala 


Thr 


Phe 


Lys 


Gly 


Phe 


Arg 


Thr 


Ser 


His 


Arg 


Asn 


Glu 


He 


Phe 


He 










405 










410 










415 




Leu 


Ser 


Arg 


Ala 


Gly 


Tyr 


Ala 


Gly 


He 


Gin 


Arg 


Tyr 


Ala 


Phe 


He 


Trp 








420 










425 










430 






Thr 


Gly 


Asp 


Asn 


Thr 


Pro 


Ser 


Trp 


Asp 


Asp 


Leu 


Lys 


Leu 


Gin 


Leu 


Gin 






435 










440 










445 








Leu 


Val 


Leu 


Gly 


Leu 


Ser 


He 


Ser 


Gly 


Val 


Pro 


Phe 


Val 


Gly 


Cys 


Asp 




450 










455 










460 










lie 


Gly 


Gly 


Phe 


Gin 


Gly 


Arg 


Asn 


Phe 


Ala 


Glu 


He 


Asp 


Asn 


Ser 


Met 


465 










470 










475 










480 


Asp 


Leu 


Leu 


Val 


Lys 


Tyr 


Tyr 


Ala 


Leu 


Ala 


Leu 


Phe 


Phe 


Pro 


Phe 


Tyr 










485 










490 










495 




Arg 


Ser 


His 


Lys 


Ala 


Thr 


Asp 


Gly 


He 


Asp 


Thr 


Glu 


Pro 


Val 


Phe 


Leu 








500 










505 










510 






Pro 


Asp 


Tyr 


Tyr 


Lys 


Glu 


Lys 


Val 


Lys 


Glu 


He 


Val 


Glu 


Leu 


Arg 


Tyr 
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515 520 525 



Lys 


Phe 


Leu 


Pro 


Tyr 


He 


Tyr 


Ser 


Leu 


Ala 


Leu 


Glu 


Ala 


Ser 


Glu 


Lys 




53 0 










535 










54 0 










Gly 


His 


Pro 


Val 


lie 


Arg 


Pro 


Leu 


Phe 


Tyr 


Glu 


Phe 


Gin 


Asp 


Asp 


Asp 


545 










550 










555 










560 


Asp 


Met 


Tyr 


Arg 


He 


Glu 


Asp 


Glu 


Tyr 


Met 


Val 


Gly 


Lys 


Tyr 


Leu 


Leu 










565 










570 










575 




Tyr 


Ala 


Pro 


He 


Val 


Ser 


Lys 


Glu 


Glu 


Ser 


Arg 


Leu 


Val 


Thr 


Leu 


Pro 








580 










585 










590 






Arg 


Gly 


Lys 


Trp 


Tyr 


Asn 


Tyr 


Trp 


Asn 


Gly 


Glu 


He 


He 


Asn 


Gly 


Lys 






595 










600 










605 








Ser 


Val 


Val 


Lys 


Ser 


Thr 


His 


Glu 


Leu 


Pro 


He 


Tyr 


Leu 


Arg 


Glu 


Gly 




610 










615 










620 










Ser 


He 


He 


Pro 


Leu 


Glu 


Gly 


Asp 


Glu 


Leu 


He 


Val 


Tyr 


Gly 


Glu 


Thr 


625 










630 










635 










640 


Ser 


Phe 


Lys 


Arg 


Tyr 


Asp 


Asn 


Ala 


Glu 


He 


Thr 


Ser 


Ser 


Ser 


Asn 


Glu 










645 










650 










655 




He 


Lys 


Phe 


Ser 


Arg 


Glu 


He 


Tyr 


Val 


Ser 


Lys 


Leu 


Thr 


He 


Thr 


Ser 








660 










665 










670 






Glu 


Lys 


Pro 


Val 


Ser 


Lys 


He 


He 


Val 


Asp 


Asp 


Ser 


Lys 


Glu 


lie 


Gin 






675 










680 










685 








Val 


Glu 


Lys 


Thr 


Met 


Gin 


Asn 


Thr 


Tyr 


Val 


Ala 


Lys 


He 


Asn 


Gin 


Lys 




690 










695 










700 










He 


Arg 


Gly 


Lys 


He 


Asn 


Leu 


Glu 


















705 










710 























<210> 28 
<211> 469 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 28 



Met 


Arg 


Val 


Leu 


Leu 


Val 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 


1 








5 










10 










15 




Ala 


Thr 


Ser 


Met 
20 


Ala 


Glu 


Phe 


Phe 


Pro 
25 


Glu 


lie 


Pro 


Lys 


He 
30 


Gin 


Phe 


Glu 


Gly 


Lys 
35 


Glu 


Ser 


Thr 


Asn 


Pro 
40 


Leu 


Ala 


Phe 


Arg 


Phe 
45 


Tyr 


Asp 


Pro 


Asn 


Glu 
50 


Val 


He 


Asp 


Gly 


Lys 
55 


Pro 


Leu 


Lys 


Asp 


His 
60 


Leu 


Lys 


Phe 


Ser 


Val 


Ala 


Phe 


Trp 


His 


Thr 


Phe 


Val 


Asn 


Glu 


Gly 


Arg 


Asp 


Pro 


Phe 


Gly 


65 










70 










75 










80 


Asp 


Pro 


Thr 


Ala 


Glu 
85 


Arg 


Pro 


Trp 


Asn 


Arg 
90 


Phe 


Ser 


Asp 


Pro 


Met 
95 


Asp 


Lys 


Ala 


Phe 


Ala 
100 


Arg 


Val 


Asp 


Ala 


Leu 
105 


Phe 


Glu 


Phe 


Cys 


Glu 
110 


Lys 


Leu 


Asn 


lie 


Glu 
115 


Tyr 


Phe 


Cys 


Phe 


His 
120 


Asp 


Arg 


Asp 


He 


Ala 
125 


Pro 


Glu 


Gly 


Lys 


Thr 
130 


Leu 


Arg 


Glu 


Thr 


Asn 
135 


Lys 


He 


Leu 


Asp 


Lys 
140 


Val 


Val 


Glu 


Arg 
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lie 


Lys 


Glu 


Arg 


Met 


Lys 


Asp 


Ser 


Asn 


Val 


Lys 


Leu 


Leu 


Trp 


Gly 


Thr 


145 








150 










155 










160 


Ala 


Asn 


Leu 


Phe 


Ser 


His 


Pro 


Arg 


Tyr 


Met 


His 


Gly 


Ala 


Ala 


Thr 


Thr 










165 










170 










175 




Cys 


Ser 


Ala 


Asp 


Val 


Phe 


Ala 


Tyr 


Ala 


Ala 


Ala 


Gin 


Val 


Lys 


Lys 


Ala 






180 










185 










190 






Leu 


Glu 


He 


Thr 


Lys 


Glu 


Leu 


Gly 


Gly 


Glu 


Gly 


Tyr 


Val 


Phe 


Trp 


Gly 






195 










200 










205 








Gly 


Arg 


Glu 


Gly 


Tyr 


Glu 


Thr 


Leu 


Leu 


Asn 


Thr 


Asp 


Leu 


Gly 


Leu 


Glu 


210 










215 










220 










Leu 


Glu 


Asn 


Leu 


Ala 


Arg 


Phe 


Leu 


Arg 


Met 


Ala 


Val 


Glu 


Tyr 


Ala 


Lys 


225 










230 










235 










240 


Lys 


He 


Gly 


Phe 


Thr 


Gly 


Gin 


Phe 


Leu 


He 


Glu 


Pro 


Lys 


Pro 


Lys 


Glu 








245 










250 










255 




Pro 


Thr 


Lys 


His 


Gin 


Tyr 


Asp 


Phe 


Asp 


Val 


Ala 


Thr 


Ala 


Tyr 


Ala 


Phe 








260 










265 










270 






Leu 


Lys 


Asn 


His 


Gly 


Leu 


Asp 


Glu 


Tyr 


Phe 


Lys 


Phe 


Asn 


He 


Glu 


Ala 






275 










280 










285 








Asn 


His 


Ala 


Thr 


Leu 


Ala 


Gly 


His 


Thr 


Phe 


Gin 


His 


Glu 


Leu 


Arg 


Met 




290 










295 










300 










Ala 


Arg 


He 


Leu 


Gly 


Lys 


Leu 


Gly 


Ser 


He 


Asp 


Ala 


Asn 


Gin 


Gly Asp 


305 








310 










315 










320 


Leu 


Leu 


Leu 


Gly 


Trp 


Asp 


Thr 


Asp 


Gin 


Phe 


Pro 


Thr 


Asn 


He 


Tyr 


Asp 










325 










330 










335 




Thr 


Thr 


Leu 


Ala 


Met 


Tyr 


Glu 


Val 


lie 


Lys 


Ala 


Gly 


Gly 


Phe 


Thr 


Lys 








340 










345 










350 






Gly 


Gly 


Leu 


Asn 


Phe 


Asp 


Ala 


Lys 


Val 


Arg 


Arg 


Ala 


Ser 


Tyr 


Lys 


Val 




355 










360 










365 








Glu 


Asp 


Leu 


Phe 


He 


Gly 


His 


He 


Ala 


Gly 


Met 


Asp 


Thr 


Phe 


Ala 


Leu 




370 










375 










380 










Gly 


Phe 


Lys 


He 


Ala 


Tyr 


Lys 


Leu 


Ala 


Lys 


Asp 


Gly 


Val 


Phe 


Asp 


Lys 


385 










390 










395 










400 


Phe 


He 


Glu 


Glu 


Lys 


Tyr 


Arg 


Ser 


Phe 


Lys 


Glu 


Gly 


He 


Gly 


Lys 


Glu 










405 










410 










415 




He 


Val 


Glu 


Gly 


Lys 


Thr 


Asp 


Phe 


Glu 


Lys 


Leu 


Glu 


Glu 


Tyr 


He 


He 








420 










425 










430 






Asp 


Lys 


Glu 


Asp 


He 


Glu 


Leu 


Pro 


Ser 


Gly 


Lys 


Gin 


Glu 


Tyr 


Leu 


Glu 


435 










440 










445 








Ser 


Leu 


Leu 


Asn 


Ser 


Tyr 


He 


Val 


Lys 


Thr 


He 


Ala 


Glu 


Leu 


Arg 


Ser 




450 










455 










460 










Glu 


Lys 


Asp 


Glu 


Leu 

























465 



<210> 29 
<211> 469 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 29 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
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15 10 15 

Ala Thr Ser Met Ala Glu Phe Phe Pro Glu lie Pro Lys Val Gin Phe 

20 25 30 

Glu Gly Lys Glu Ser Thr Asn Pro Leu Ala Phe Lys Phe Tyr Asp Pro 

35 40 45 

Glu Glu lie lie Asp Gly Lys Pro Leu Lys Asp His Leu Lys Phe Ser 

50 55 60 

Val Ala Phe Trp His Thr Phe Val Asn Glu Gly Arg Asp Pro Phe Gly 
65 70 75 80 

Asp Pro Thr Ala Asp Arg Pro Trp Asn Arg Tyr Thr Asp Pro Met Asp 

85 90 95 

Lys Ala Phe Ala Arg Val Asp Ala Leu Phe Glu Phe Cys Glu Lys Leu 

100 105 110 

Asn lie Glu Tyr Phe Cys Phe His Asp Arg Asp lie Ala Pro Glu Gly 

115 120 125 

Lys Thr Leu Arg Glu Thr Asn Lys lie Leu Asp Lys Val Val Glu Arg 

130 135 140 

lie Lys Glu Arg Met Lys Asp Ser Asn Val Lys Leu Leu Trp Gly Thr 
145 150 155 160 

Ala Asn Leu Phe Ser His Pro Arg Tyr Met His Gly Ala Ala Thr Thr 

165 170 175 

Cys Ser Ala Asp Val Phe Ala Tyr Ala Ala Ala Gin Val Lys Lys Ala 

180 185 190 

Leu Glu lie Thr Lys Glu Leu Gly Gly Glu Gly Tyr Val Phe Trp Gly 

195 200 205 

Gly Arg Glu Gly Tyr Glu Thr Leu Leu Asn Thr Asp Leu Gly Phe Glu 

210 215 220 

Leu Glu Asn Leu Ala Arg Phe Leu Arg Met Ala Val Asp Tyr Ala Lys 
225 230 235 240 

Arg lie Gly Phe Thr Gly Gin Phe Leu lie Glu Pro Lys Pro Lys Glu 

245 250 255 

Pro Thr Lys His Gin Tyr Asp Phe Asp Val Ala Thr Ala Tyr Ala Phe 

260 265 270 

Leu Lys Ser His Gly Leu Asp Glu Tyr Phe Lys Phe Asn lie Glu Ala 

275 280 285 

Asn His Ala Thr Leu Ala Gly His Thr Phe Gin His Glu Leu Arg Met 

290 295 300 

Ala Arg He Leu Gly Lys Leu Gly Ser He Asp Ala Asn Gin Gly Asp 
305 310 315 320 

Leu Leu Leu Gly Trp Asp Thr Asp Gin Phe Pro Thr Asn Val Tyr Asp 

325 330 335 

Thr Thr Leu Ala Met Tyr Glu Val He Lys Ala Gly Gly Phe Thr Lys 

340 345 350 

Gly Gly Leu Asn Phe Asp Ala Lys Val Arg Arg Ala Ser Tyr Lys Val 

355 360 365 

Glu Asp Leu Phe He Gly His lie Ala Gly Met Asp Thr Phe Ala Leu 

370 375 380 

Gly Phe Lys Val Ala Tyr Lys Leu Val Lys Asp Gly Val Leu Asp Lys 
385 390 395 400 

Phe He Glu Glu Lys Tyr Arg Ser Phe Arg Glu Gly He Gly Arg Asp 

405 410 415 

He Val Glu Gly Lys Val Asp Phe Glu Lys Leu Glu Glu Tyr He He 

420 425 430 

Asp Lys Glu Thr He Glu Leu Pro Ser Gly Lys Gin Glu Tyr Leu Glu 
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435 440 445 

Ser Leu lie Asn Ser Tyr lie Val Lys Thr lie Leu Glu Leu Arg Ser 

450 455 460 

Glu Lys Asp Glu Leu 
465 



<210> 30 

<211> 463 

<212> PRT 

<213> Artificia 

<220> 

<223> synthetic 

<400> 30 
Met Arg Val Leu 
1 

Ala Thr Ser Met 

20 

Glu Gly Lys Glu 
35 

Glu Glu lie lie 
50 

Val Ala Phe Trp 
65 

Asp Pro Thr Ala 

Lys Ala Phe Ala 

100 

Asn lie Glu Tyr 
115 

Lys Thr Leu Arg 
130 

lie Lys Glu Arg 
145 

Ala Asn Leu Phe 

Cys Ser Ala Asp 

180 

Leu Glu lie Thr 
195 

Gly Arg Glu Gly 
210 

Leu Glu Asn Leu 
225 

Arg lie Gly Phe 

Pro Thr Lys His 

260 

Leu Lys Ser His 
275 

Asn His Ala Thr 
290 



Sequence 



Leu 


Val 


Ala 


Leu 


5 








Ala 


Glu 


Phe 


Phe 


Ser 


Thr 


Asn 


Pro 








40 


Asd 


Gly 


Lys 


Pro 






55 




His 


Thr 


Phe 


Val 




70 






Asp 


Arg 


Pro 


Trp 


85 








Arg 


Val 


Asp 


Ala 


Phe 


Cvs 


Phe 


His 








120 


Glu 


Thr 


Asn 


Lys 






135 




Met 


Lys 


Asp 


Ser 




150 






Ser 


His 


Pro 


Arg 


165 








Val 


Phe 


Ala 


Tyr 


Lys 


Glu 


Leu 


Gly 








200 


Tyr 


Glu 


Thr 


Leu 






215 




Ala 


Arg 


Phe 


Leu 




230 






Thr 


Gly 


Gin 


Phe 


245 








Gin 


Tyr 


Asp 


Phe 


Gly 


Leu 


Asp 


Glu 








280 


Leu 


Ala 


Gly 


His 






295 





Ala Leu Leu Ala 
10 

Pro Glu lie Pro 
25 

Leu Ala Phe Lys 

Leu Lys Asp His 

60 

Asn Glu Gly Arg 
75 

Asn Arg Tyr Thr 
90 

Leu Phe Glu Phe 
105 

Asp Arg Asp lie 

lie Leu Asp Lys 

140 

Asn Val Lys Leu 
155 

Tyr Met His Gly 
170 

Ala Ala Ala Gin 
185 

Gly Glu Gly Tyr 

Leu Asn Thr Asp 

220 

Arg Met Ala Val 
235 

Leu lie Glu Pro 
250 

Asp Val Ala Thr 
265 

Tyr Phe Lys Phe 

Thr Phe Gin His 

300 

201 



Leu Ala Ala Ser 
15 

Lys Val Gin Phe 
30 

Phe Tyr Asp Pro 
45 

Leu Lys Phe Ser 

Asp Pro Phe Gly 

80 

Asp Pro Met Asp 
95 

Cys Glu Lys Leu 
110 

Ala Pro Glu Gly 
125 

Val Val Glu Arg 

Leu Trp Gly Thr 

160 

Ala Ala Thr Thr 
175 

Val Lys Lys Ala 
190 

Val Phe Trp Gly 
205 

Leu Gly Phe Glu 

Asp Tyr Ala Lys 

240 

Lys Pro Lys Glu 
255 

Ala Tyr Ala Phe 
270 

Asn lie Glu Ala 
285 

Glu Leu Arg Met 
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Ala 


Arg 


lie 


Leu 


Gly 


Lys 


Leu 


Gly 


Ser 


He 


Asp 


Ala 


Asn 


Gin 


Gly 


Asp 


305 










310 










315 










320 


Leu 


Leu 


Leu 


Gly 


Trp 
325 


Asp 


Thr 


Asp 


Gin 


Phe 
330 


Pro 


Thr 


Asn 


Val 


Tyr 
335 


Asp 


Thr 


Thr 


Leu 


Ala 
340 


Met 


Tyr 


Glu 


Val 


He 
345 


Lys 


Ala 


Gly 


Gly 


Phe 
350 


Thr 


Lys 


Gly 


Gly 


Leu 
355 


Asn 


Phe 


Asp 


Ala 


Lys 
360 


Val 


Arg 


Arg 


Ala 


Ser 
365 


Tyr 


Lys 


Val 


Glu 


Asp 

370 


Leu 


Phe 


He 


Gly 


His 
375 


He 


Ala 


Gly 


Met 


Asp 
380 


Thr 


Phe 


Ala 


Leu 


Gly 


Phe 


Lys 


Val 


Ala 


Tyr 


Lys 


Leu 


Val 


Lys 


Asp 


Gly 


Val 


Leu 


Asp 


Lys 


385 










390 










395 










400 


Phe 


He 


Glu 


Glu 


Lys 
405 


Tyr 


Arg 


Ser 


Phe 


Arg 
410 


Glu 


Gly 


lie 


Gly 


Arg 
415 


Asp 


He 


Val 


Glu 


Gly 
420 


Lys 


Val 


Asp 


Phe 


Glu 
425 


Lys 


Leu 


Glu 


Glu 


Tyr 
430 


He 


He 


Asp 


Lys 


Glu 
435 


Thr 


He 


Glu 


Leu 


Pro 
440 


Ser 


Gly 


Lys 


Gin 


Glu 
445 


Tyr 


Leu 


Glu 


Ser 


Leu 
450 


He 


Asn 


Ser 


Tyr 


He 
455 


Val 


Lys 


Thr 


He 


Leu 
460 


Glu 


Leu 


Arg 





<210> 31 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 31 

Met Gly Lys Asn Gly Asn Leu Cys Cys Phe Ser Leu Leu Leu Leu Leu 

15 10 15 

Leu Ala Gly Leu Ala Ser Gly His Gin 

20 25 



<210> 32 
<211> 30 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 32 

Met Gly Phe Val Leu Phe Ser Gin Leu Pro Ser Phe Leu Leu Val Ser 

15 10 15 

Thr Leu Leu Leu Phe Leu Val He Ser His Ser Cys Arg Ala 

20 25 30 



<210> 33 
<211> 460 



202 
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<212> PRT 
<213> Artificia 

<220> 

<223> synthetic 

<400> 33 
Met Arg Val Leu 
1 

Ala Thr Ser Ala 

20 

Gin Ala Phe Tyr 
35 

lie Arg Gin Lys 
50 

Trp lie Pro Pro 
65 

Tyr Asp Pro Tyr 

Thr Val Glu Thr 

100 

Asn Thr Ala His 
115 

Asn His Arg Ala 
130 

Tyr Thr Trp Thr 
145 

Asn Tyr Leu Asp 

Thr Phe Gly Gly 

180 

Tyr Trp Leu Trp 
195 

lie Gly lie Asp 
210 

Trp Val Val Lys 
225 

Glu Tyr Trp Asp 

Ser Gly Ala Lys 

260 

Ala Phe Asp Asn 
275 

Gly Gly Thr Val 
290 

Ala Asn His Asp 
305 

Phe lie Leu Thr 

Glu Glu Trp Leu 

340 

Asp Asn Leu Ala 
355 

Glu Met lie Phe 



Sequence 



Leu 


Val 


Ala 


Leu 


5 








Lys 


Tyr 


Leu 


Glu 


Trp 


Asp 


Val 


Pro 








40 


He 


Pro 


Glu 


Trp 






55 




Ala 


Ser 


Lys 


Gly 




7 0 






Asp 


Tyr 


Phe 


Asp 


85 








Arg 


Phe 


Gly 


Ser 


Ala 


Tyr 


Gly 


He 








120 


Gly 


Gly 


Asp 


Leu 






135 




Asp 


Phe 


Ser 


Lys 




150 






Phe 


His 


Pro 


Asn 


165 








Tyr 


Pro 


Asp 


He 


Ala 


Ser 


Gin 


Glu 








200 


Ala 


Trp 


Arg 


Phe 






215 




Asp 


Trp 


Leu 


Asn 




Tin 
Z j u 






Thr 


Asn 


Val 


Asp 


245 








Val 


Phe 


Asp 


Phe 


Lys 


Asn 


He 


Pro 








280 


Val 


Ser 


Arg 


Asp 






295 




Thr 


Asp 


He 


He 




310 






Tyr 


Glu 


Gly 


Gin 


325 








Asn 


Lys 


Asp 


Lys 


Gly 


Gly 


Ser 


Thr 








360 


Val 


Arg 


Asn 


Gly 



Ala Leu Leu Ala 
10 

Leu Glu Glu Gly 
25 

Ser Gly Gly He 

Tyr Asp Ala Gly 

60 

Met Ser Gly Gly 
75 

Leu Gly Glu Tyr 
90 

Lys Gin Glu Leu 
105 

Lys Val He Ala 

Glu Trp Asn Pro 

140 

Val Ala Ser Gly 
155 

Glu Leu His Ala 
170 

Cys His Asp Lys 
185 

Ser Tyr Ala Ala 

Asp Tyr Val Lys 

220 

Trp Trp Gly Gly 
235 

Ala Leu Leu Asn 
250 

Pro Leu Tyr Tyr 
265 

Ala Leu Val Glu 

Pro Phe Lys Ala 

300 

Trp Asn Lys Tyr 
315 

Pro Thr He Phe 
330 

Leu Lys Asn Leu 
345 

Ser He Val Tyr 

Tyr Gly Ser Lys 
203 



Leu Ala Ala Ser 
15 

Gly Val He Met 
30 

Trp Trp Asp Thr 
45 

He Ser Ala He 

Tyr Ser Met Gly 

80 

Tyr Gin Lys Gly 
95 

He Asn Met He 
110 

Asp He Val He 
125 

Phe Val Gly Asp 

Lys Tyr Thr Ala 

160 

Gly Asp Ser Gly 
175 

Ser Trp Asp Gin 
190 

Tyr Leu Arg Ser 
205 

Gly Tyr Gly Ala 

Trp Ala Val Gly 

240 

Trp Ala Tyr Ser 
255 

Lys Met Asp Ala 
270 

Ala Leu Lys Asn 
285 

Val Thr Phe Val 

Pro Ala Tyr Ala 

320 

Tyr Arg Asp Tyr 
335 

He Trp He His 
350 

Tyr Asp Ser Asp 
365 

Pro Gly Leu He 
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370 










375 










380 








Thr 


Tyr 


He 


Asn 


Leu 


Gly 


Ser 


Ser 


Lys 


Val 


Gly 


Arg 


Trp 


Val Tyr 


Val 


385 








390 










395 








400 


Pro 


Lys 

** 


Phe 


Ala 


Gly 


Ala 


Cys 


He 


His 


Glu 


Tyr 


Thr 


Gly 


Asn Leu 


Gly 








405 










410 








415 




Gly 


Trp 


Val 


Asp 
420 


Lys 


Tyr 


Val 


Tyr 


Ser 
425 


Ser 


Gly 


Trp 


Val 


Tyr Leu 
430 


Glu 


Ala 


Pro 


Ala 
435 


Tyr 


Asp 


Pro 


Ala 


Asn 
440 


Gly 


Gin 


Tyr 


Gly 


Tyr 
445 


Ser Val 


Trp 


Ser 


Tyr 
450 


Cys 


Gly 


Val 


Gly 


Ser 
455 


Glu 


Lys 


Asp 


Glu 


Leu 

460 









<210> 34 
<211> 825 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 34 



Met 


Arg 


Val 


Leu 


Leu 


Val 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 


1 






5 










10 










15 




Ala 


Thr 


Ser 


Ala 


Gly 


His 


Trp 


Tyr 


Lys 


His 


Gin 


Arg 


Ala 


Tyr 


Gin 


Phe 








20 










25 










30 






Thr 


Gly 


Glu 


Asp 


Asp 


Phe 


Gly 


Lys 


Val 


Ala 


Val 


Val 


Lys 


Leu 


Pro 


Met 






35 










40 










45 








Asp 


Leu 


Thr 


Lys 


Val 


Gly 


He 


He 


Val 


Arg 


Leu 


Asn 


Glu 


Trp 


Gin 


Ala 




50 










55 










60 










Lys 


Asp 


Val 


Ala 


Lys 


Asp 


Arg 


Phe 


lie 


Glu 


He 


Lys 


Asp 


Gly 


Lys 


Ala 


65 










70 










75 










80 


Glu 


val 


Trp 


He 


Leu 


Gin 


Gly 


Val 


Glu 


Glu 


He 


Phe 


Tyr 


Glu 


Lys 


Pro 










85 










90 










95 




Asp 


Thr 


Ser 


Pro 


Arg 


He 


Phe 


Phe 


Ala 


Gin 


Ala 


Arg 


Ser 


Asn 


Lys 


Val 






100 










105 










110 






He 


Glu 


Ala 


Phe 


Leu 


Thr 


Asn 


Pro 


Val 


Asp 


Thr 


Lys 


Lys 


Lys 


Glu 


Leu 






115 










120 










125 








Phe 


Lys 


Val 


Thr 


val 


Asp 


Gly 


Lys 


Glu 


lie 


Pro 


Val 


Ser 


Arg 


Val 


Glu 




130 










135 










140 










Lys 


Ala 


Asp 


Pro 


Thr 


Asp 


He 


Asp 


Val 


Thr 


Asn 


Tyr 


Val 


Arg 


lie 


val 


145 










150 










155 










160 


Leu 


Ser 


Glu 


Ser 


Leu 


Lys 


Glu 


Glu 


Asp 


Leu 


Arg 


Lys 


Asp 


Val 


Glu 


Leu 










165 










170 










175 




He 


He 


Glu 


Gly 


Tyr 


Lys 


Pro 


Ala 


Arg 


Val 


He 


Met 


Met 


Glu 


He 


Leu 








180 










185 










190 






Asp 


Asp 


Tyr 


Tyr 


Tyr 


Asp 


Gly 


Glu 


Leu 


Gly 


Ala 


Val 


Tyr 


Ser 


Pro 


Glu 






195 










200 










205 








Lys 


Thr 


He 


Phe 


Arg 


Val 


Trp 


Ser 


Pro 


Val 


Ser 


Lys 


Trp 


Val 


Lys 


Val 


210 










215 










220 










Leu 


Leu 


Phe 


Lys 


Asn 


Gly 


Glu 


Asp 


Thr 


Glu 


Pro 


Tyr 


Gin 


Val 


Val 


Asn 


225 










230 










235 










240 


Met 


Glu 


Tyr 


Lys 


Gly 


Asn 


Gly 


Val 


Trp 


Glu 


Ala 


Val 


Val 


Glu 


Gly 


Asp 










245 










250 










255 





204 
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Leu 


Asp 


Gly 


Val 


Phe 


Tyr 


Leu 


Tyr 


Gin 


Leu 


Glu 


Asn 


Tyr 


Gly 


Lys 


He 






260 










265 










270 






Arg 


Thr 


Thr 
275 


Val 


Asp 


Pro 


Tyr 


Ser 
280 


Lys 


Ala 


Val 


Tyr 


Ala 
285 


Asn 


Asn 


Gin 


Glu 


Ser 
290 


Ala 


Val 


Val 


Asn 


Leu 
295 


Ala 


Arg 


Thr 


Asn 


Pro 
300 


Glu 


Gly 


Trp 


Glu 


Asn 


Asp 


Arg 


Gly 


Pro 


Lys 


He 


Glu 


Gly 


Tyr 


Glu 


Asp 


Ala 


He 


He 


Tyr 


305 






310 










315 










320 


Glu 


He 


His 


He 


Ala 

325 


Asp 


He 


Thr 


Gly 


Leu 

330 


Glu 


Asn 


Ser 


Gly 


Val 
335 


Lys 


Asn 


Lys 

mm 


Gly 


Leu 


Tyr 


Leu 


Gly 


Leu 


Thr 


Glu 


Glu 


Asn 


Thr 


Lys 


Ala 


Pro 






340 










345 










350 






Gly 


Gly 


Val 


Thr 


Thr 


Gly 


Leu 


Ser 


His 


Leu 


Val 


Glu 


Leu 


Gly 


Val 


Thr 


355 










360 










365 








His 


Val 
370 


His 


He 


Leu 


Pro 


Phe 

375 


Phe 


Asp 


Phe 


Tyr 


Thr 
380 


Gly 


Asp 


Glu 


Leu 


Asp 


Lys 

mi 


Asp 


Phe 


Glu 


Lys 


Tyr 


Tyr 


Asn 


Trp 


Gly 


Tyr 


Asp 


Pro 


Tyr 


Leu 


385 








390 










395 










400 


Phe 


Met 


Val 


Pro 


Glu 
405 


Gly 


Arg 


Tyr 


Ser 


Thr 
410 


Asp 


Pro 


Lys 


Asn 


Pro 
415 


His 


Thr 


Arg 


He 


Arg 


Glu 


Val 


Lys 


Glu 


Met 


Val 


Lys 


Ala 


Leu 


His 


Lys 


His 






420 










425 










430 






Gly 


He 


Gly 


Val 


He 


Met 


Asp 


Met 


Val 


Phe 


Pro 


His 


Thr 


Tyr 


Gly 


He 




435 










440 










445 








Gly 

-< 


Glu 


Leu 


Ser 


Ala 


Phe 


Asp 


Gin 


Thr 


Val 


Pro 


Tyr 


Tyr 


Phe 


Tyr 


Arg 


450 










455 










460 










lie 


Asp 


Lys 


Thr 


Gly 


Ala 


Tyr 


Leu 


Asn 


Glu 


Ser 


Gly 


Cys 


Gly 


Asn 


Val 


465 






470 










475 










480 


He 


Ala 


Ser 


Glu 


Arg 
485 


Pro 


Met 


Met 


Arg 


Lys 
490 


Phe 


He 


Val 


Asp 


Thr 
495 


Val 


Thr 


Tyr 


Trp 


Val 


Lys 


Glu 


Tyr 


His 


He 


Asp 


Gly 


Phe 


Arg 


Phe 


Asp 


Gin 






500 










505 










510 






Met 


Gly 


Leu 
515 


He 


Asp 


Lys 


Lys 


Thr 
520 


Met 


Leu 


Glu 


Val 


Glu 
525 


Arg 


Ala 


Leu 


His 


Lys 


He 


Asp 


Pro 


Thr 


He 


He 


Leu 


Tyr 


Gly 


Glu 


Pro 


Trp 


Gly 


Gly 




530 










535 










540 








His 


Trp 


Gly 


Ala 


Pro 


He 


Arg 


Phe 


Gly 


Lys 


Ser 


Asp 


Val 


Ala 


Gly 


Thr 


545 








550 










555 










560 


Val 


Ala 


Ala 


Phe 


Asn 
565 


Asp 


Glu 


Phe 


Arg 


Asp 
570 


Ala 


He 


Arg 


Gly 


Ser 
575 


Val 


Phe 


Asn 


Pro 


Ser 
580 


Val 


Lys 


Gly 


Phe 


Val 
585 


Met 


Gly 


Gly 


Tyr 


Gly 
590 


Lys 


Glu 


Thr 


Lys 


He 
595 


Lys 


Arg 


Gly 


Val 


Val 
600 


Gly 


Ser 


He 


Asn 


Tyr 
605 


Asp 


Gly 


Lys 


Leu 


He 


Lys 


Ser 


Phe 


Ala 


Leu 


Asp 


Pro 


Glu 


Glu 


Thr 


He 


Asn 


Tyr 


Ala 




610 








615 










620 










Ala 


Cys 


His 


Asp 


Asn 


His 


Thr 


Leu 


Trp 


Asp 


Lys 


Asn 


Tyr 


Leu 


Ala 


Ala 


625 






630 










635 










640 


Lys 


Ala 


Asp 


Lys 


Lys 


Lys 


Glu 


Trp 


Thr 


Glu 


Glu 


Glu 


Leu 


Lys 


Asn 


Ala 








645 










650 










655 




Gin 


Lys 


Leu 


Ala 


Gly 


Ala 


He 


Leu 


Leu 


Thr 


Ser 


Gin 


Gly 


Val 


Pro 


Phe 






660 










665 










670 






Leu 


His 


Gly 
675 


Gly 


Gin 


Asp 


Phe 


Cys 
680 


Arg 


Thr 


Thr 


Asn 


Phe 
685 


Asn 


Asp 


Asn 
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Ser Tyr Asn Ala Pro He Ser He Asn Gly Phe Asp Tyr Glu Arg Lys 

690 695 700 

Leu Gin Phe He Asp Val Phe Asn Tyr His Lys Gly Leu He Lys Leu 
705 710 715 720 

Arg Lys Glu His Pro Ala Phe Arg Leu Lys Asn Ala Glu Glu He Lys 

725 730 735 

Lys His Leu Glu Phe Leu Pro Gly Gly Arg Arg He Val Ala Phe Met 

740 745 750 

Leu Lys Asp His Ala Gly Gly Asp Pro Trp Lys Asp He Val Val He 

755 760 765 

Tyr Asn Gly Asn Leu Glu Lys Thr Thr Tyr Lys Leu Pro Glu Gly Lys 

770 775 780 

Trp Asn Val Val Val Asn Ser Gin Lys Ala Gly Thr Glu Val He Glu 
785 790 795 800 

Thr Val Glu Gly Thr He Glu Leu Asp Pro Leu Ser Ala Tyr Val Leu 

805 810 . 815 

Tyr Arg Glu Ser Glu Lys Asp Glu Leu 

820 825 



<210> 35 
<211> 460 
<212> PRT 
<213> Artificia 

<220> 

<223> synthetic 

<400> 35 
Met Arg Val Leu 
1 

Ala Thr Ser Ala 

20 

Gin Ala Phe Tyr 
35 

lie Arg Gin Lys 
50 

Trp He Pro Pro 
65 

Tyr Asp Pro Tyr 

Thr Val Glu Thr 

100 

Asn Thr Ala His 
115 

Asn His Arg Ala 
130 

Tyr Thr Trp Thr 
145 

Asn Tyr Leu Asp 

Thr Phe Gly Gly 

180 

Tyr Trp Leu Trp 



Sequence 



Leu 


Val 


Ala 


Leu 


5 








Lys 


Tyr 


Leu 


Glu 


Trp 


Asp 


s 

Val 


Pro 








40 


He 


Pro 


Glu 


Trp 






55 




Ala 


Ser 


Lys 


Gly 




70 






Asp 


Tyr 


Phe 


Asp 


85 








Arg 


Phe 


Gly 


Ser 


Ala 


Tyr 


Gly 


He 








120 


Gly 


Gly 


Asp 


Leu 






135 




Asp 


Phe 


Ser 


Lys 




150 






Phe 


His 


Pro 


Asn 


165 








Tyr 


Pro 


Asp 


He 


Ala 


Ser 


Gin 


Glu 



Ala Leu Leu Ala 
10 

Leu Glu Glu Gly 
25 

Ser Gly Gly He 

Tyr Asp Ala Gly 

60 

Met Ser Gly Gly 
75 

Leu Gly Glu Tyr 
90 

Lys Gin Glu Leu 
105 

Lys Val He Ala 

Glu Trp Asn Pro 

140 

Val Ala Ser Gly 
155 

Glu Leu His Ala 
170 

Cys His Asp Lys 
185 

Ser Tyr Ala Ala 
206 



Leu Ala Ala Ser 
15 

Gly Val He Met 
30 

Trp Trp Asp Thr 
45 

He Ser Ala He 

Tyr Ser Met Gly 

80 

Tyr Gin Lys Gly 
95 

lie Asn Met lie 
110 

Asp He Val He 
125 

Phe Val Gly Asp 

Lys Tyr Thr Ala 

160 

Gly Asp Ser Gly 
175 

Ser Trp Asp Gin 
190 

Tyr Leu Arg Ser 
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195 200 205 

He Gly He Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala 

210 215 220 

Trp Val Val Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly 
225 230 235 240 

Glu Tyr Trp Asp Thr Asn Val Asp Ala Leu Leu Asn Trp Ala Tyr Ser 

245 250 255 

Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Ala 

260 265 270 

Ala Phe Asp Asn Lys Asn He Pro Ala Leu Val Glu Ala Leu Lys Asn 

275 280 285 

Gly Gly Thr Val Val Ser Arg Asp Pro Phe Lys Ala Val Thr Phe Val 

290 295 300 

Ala Asn His Asp Thr Asp He lie Trp Asn Lys Tyr Pro Ala Tyr Ala 
305 310 315 320 

Phe He Leu Thr Tyr Glu Gly Gin Pro Thr He Phe Tyr Arg Asp Tyr 

325 330 335 

Glu Glu Trp Leu Asn Lys Asp Lys Leu Lys Asn Leu lie Trp He His 

340 345 350 

Asp Asn Leu Ala Gly Gly Ser Thr Ser lie Val Tyr Tyr Asp Ser Asp 

355 360 365 

Glu Met He Phe Val Arg Asn Gly Tyr Gly Ser Lys Pro Gly Leu lie 

370 375 380 

Thr Tyr He Asn Leu Gly Ser Ser Lys Val Gly Arg Trp Val Tyr Val 
385 390 395 400 

Pro Lys Phe Ala Gly Ala Cys lie His Glu Tyr Thr Gly Asn Leu Gly 

405 410 415 

Gly Trp Val Asp Lys Tyr Val Tyr Ser Ser Gly Trp Val Tyr Leu Glu 

420 425 430 

Ala Pro Ala Tyr Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp 

435 440 445 

Ser Tyr Cys Gly Val Gly Ser Glu Lys Asp Glu Leu 
450 455 460 



<210> 36 
<211> 718 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 36 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 

15 10 15 

Ala Thr Ser Met Glu Thr lie Lys lie Tyr Glu Asn Lys Gly Val Tyr 

20 25 30 

Lys Val Val lie Gly Glu Pro Phe Pro Pro He Glu Phe Pro Leu Glu 

35 40 45 

Gin Lys lie Ser Ser Asn Lys Ser Leu Ser Glu Leu Gly Leu Thr lie 

50 55 60 

Val Gin Gin Gly Asn Lys Val He Val Glu Lys Ser Leu Asp Leu Lys 
65 70 75 80 
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Glu His lie lie Gly Leu Gly Glu Lys Ala Phe Glu Leu Asp Arg Lys 

85 90 95 

Arg Lys Arg Tyr Val Met Tyr Asn Val Asp Ala Gly Ala Tyr Lys Lys 

100 105 110 

Tyr Gin Asp Pro Leu Tyr Val Ser lie Pro Leu Phe lie Ser Val Lys 

115 120 125 

Asp Gly Val Ala Thr Gly Tyr Phe Phe Asn Ser Ala Ser Lys Val lie 

130 135 140 

Phe Asp Val Gly Leu Glu Glu Tyr Asp Lys Val lie Val Thr lie Pro 
145 150 155 160 

Glu Asp Ser Val Glu Phe Tyr Val lie Glu Gly Pro Arg lie Glu Asp 

165 170 175 

Val Leu Glu Lys Tyr Thr Glu Leu Thr Gly Lys Pro Phe Leu Pro Pro 

180 185 190 

Met Trp Ala Phe Gly Tyr Met lie Ser Arg Tyr Ser Tyr Tyr Pro Gin 

195 200 205 

Asp Lys Val Val Glu Leu Val Asp lie Met Gin Lys Glu Gly Phe Arg 

210 215 220 

Val Ala Gly Val Phe Leu Asp lie His Tyr Met Asp Ser Tyr Lys Leu 
225 230 235 240 

Phe Thr Trp His Pro Tyr Arg Phe Pro Glu Pro Lys Lys Leu lie Asp 

245 250 255 

Glu Leu His Lys Arg Asn Val Lys Leu lie Thr lie Val Asp His Gly 

260 265 270 

lie Arg Val Asp Gin Asn Tyr Ser Pro Phe Leu Ser Gly Met Gly Lys 

275 280 285 

Phe Cys Glu lie Glu Ser Gly Glu Leu Phe Val Gly Lys Met Trp Pro 

290 295 300 

Gly Thr Thr Val Tyr Pro Asp Phe Phe Arg Glu Asp Thr Arg Glu Trp 
305 310 315 320 

Trp Ala Gly Leu lie Ser Glu Trp Leu Ser Gin Gly Val Asp Gly lie 

325 330 335 

Trp Leu Asp Met Asn Glu Pro Thr Asp Phe Ser Arg Ala lie Glu lie 

340 345 350 

Arg Asp Val Leu Ser Ser Leu Pro Val Gin Phe Arg Asp Asp Arg Leu 

355 360 365 

Val Thr Thr Phe Pro Asp Asn Val Val His Tyr Leu Arg Gly Lys Arg 

370 375 380 

Val Lys His Glu Lys Val Arg Asn Ala Tyr Pro Leu Tyr Glu Ala Met 
385 390 395 400 

Ala Thr Phe Lys Gly Phe Arg Thr Ser His Arg Asn Glu lie Phe He 

405 410 415 

Leu Ser Arg Ala Gly Tyr Ala Gly He Gin Arg Tyr Ala Phe He Trp 

420 425 430 

Thr Gly Asp Asn Thr Pro Ser Trp Asp Asp Leu Lys Leu Gin Leu Gin 

435 440 445 

Leu Val Leu Gly Leu Ser He Ser Gly Val Pro Phe Val Gly Cys Asp 

450 455 460 

He Gly Gly Phe Gin Gly Arg Asn Phe Ala Glu lie Asp Asn Ser Met 
465 470 475 480 

Asp Leu Leu Val Lys Tyr Tyr Ala Leu Ala Leu Phe Phe Pro Phe Tyr 

485 490 495 

Arg Ser His Lys Ala Thr Asp Gly He Asp Thr Glu Pro Val Phe Leu 

500 505 510 
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Pro 


Asp 


Tyr 
515 


Tyr 


Lys 


Glu 


Lys 


Val 
520 


Lys 


Glu 


He 


Val 


Glu 
525 


Leu 


Arg 


Tyr 


Lys 


Phe 


Leu 


Pro 


Tyr 


He 


Tyr 


Ser 


Leu 


Ala 


Leu 


Glu 


Ala 


Ser 


Glu 


Lys 


530 










535 










540 










Gly 


His 


Pro 


Val 


He 


Arg 


Pro 


Leu 


Phe 


Tyr 


Glu 


Phe 


Gin 


Asp 


Asp 


Asp 


545 










550 










555 










560 


Asp 


Met 


Tyr 


Arg 


He 
565 


Glu 


Asp 


Glu 


Tyr 


Met 
570 


Val 


Gly 


Lys 


Tyr 


Leu 
575 


Leu 


Tyr 


Ala 


Pro 


He 


Val 


Ser 


Lys 


Glu 


Glu 


Ser 


Arg 


Leu 


Val 


Thr 


Leu 


Pro 






580 










585 










590 






Arg 


Gly 


Lys 
595 


Trp 


Tyr 


Asn 


Tyr 


Trp 

600 


Asn 


Gly 


Glu 


He 


He 
605 


Asn 


Gly 


Lys 


Ser 


Val 


Val 


Lys 


Ser 


Thr 


His 


Glu 


Leu 


Pro 


He 


Tyr 


Leu 


Arg 


Glu 


Gly 




610 








615 










620 










Ser 


lie 


He 


Pro 


Leu 


Glu 


Gly 


Asp 


Glu 


Leu 


He 


Val 


Tyr 


Gly 


Glu 


Thr 


625 










630 










635 










640 


Ser 


Phe 


Lys 


Arg 


Tyr 


Asp 


Asn 


Ala 


Glu 


He 


Thr 


Ser 


Ser 


Ser 


Asn 


Glu 








645 










650 










655 




lie 


Lys 


Phe 


Ser 


Arg 


Glu 


He 


Tyr 


Val 


Ser 


Lys 


Leu 


Thr 


He 


Thr 


Ser 






660 










665 










670 






Glu 


Lys 


Pro 


Val 


Ser 


Lys 


He 


lie 


Val 


Asp 


Asp 


Ser 


Lys 


Glu 


He 


Gin 




675 










680 










685 








Va 1 


Glu 


Lys 


Thr 


Met 


Gin 


Asn 


Thr 


Tyr 


Val 


Ala 


Lys 


He 


Asn 


Gin 


Lys 




690 








695 










700 










lie 


Arg 


Gly 


Lys 


He 


Asn 


Leu 


Glu 


Ser 


Glu 


Lys 


Asp 


Glu 


Leu 







705 710 715 



<210> 37 
<211> 1434 
<212> DNA 

<213> Thermotoga maritima 
<400> 37 

atgaaagaaa ccgctgctgc taaattcgaa cgccagcaca tggacagccc agatctgggt 60 
accctggtgc cacgcggttc catggccgag ttcttcccgg agatcccgaa gatccagttc 120 
gagggcaagg agtccaccaa cccgctcgcc ttccgcttct acgacccgaa cgaggtgatc 180 
gacggcaagc cgctcaagga ccacctcaag ttctccgtgg ccttctggca caccttcgtg 240 
aacgagggcc gcgacccgtt cggcgacccg accgccgagc gcccgtggaa ccgcttctcc 300 
gacccgatgg acaaggcctt cgcccgcgtg gacgccctct tcgagttctg cgagaagctc 360 
aacatcgagt acttctgctt ccacgaccgc gacatcgccc cggagggcaa gaccctccgc 420 
gagaccaaca agatcctcga caaggtggtg gagcgcatca aggagcgcat gaaggactcc 4 80 
aacgtgaagc tcctctgggg caccgccaac ctcttctccc acccgcgcta catgcacggc 540 
gccgccacca cctgctccgc cgacgtgttc gcctacgccg ccgcccaggt gaagaaggcc 600 
ctggagatca ccaaggagct gggcggcgag ggctacgtgt tctggggcgg ccgcgagggc 660 
tacgagaccc tcctcaacac cgacctcggc ctggagctgg agaacctcgc ccgcttcctc 720 
cgcatggccg tggagtacgc caagaagatc ggcttcaccg gccagttcct catcgagccg 780 
aagccgaagg agccgaccaa gcaccagtac gacttcgacg tggccaccgc ctacgccttc 840 
ctcaagaacc acggcctcga cgagtacttc aagttcaaca tcgaggccaa ccacgccacc 900 
ctcgccggcc acaccttcca gcacgagctg cgcatggccc gcatcctcgg caagctcggc 960 
tccatcgacg ccaaccaggg cgacctcctc ctcggctggg acaccgacca gttcccgacc 1020 
aacatctacg acaccaccct cgccatgtac gaggtgatca aggccggcgg cttcaccaag 1080 
ggcggcctca acttcgacgc caaggtgcgc cgcgcctcct acaaggtgga ggacctcttc 1140 
atcggccaca tcgccggcat ggacaccttc gccctcggct tcaagatcgc ctacaagctc 1200 
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gccaaggacg gcgtgttcga caagttcatc gaggagaagt accgctcctt caaggagggc 1260 
atcggcaagg agatcgtgga gggcaagacc gacttcgaga agctggagga gtacatcatc 1320 
gacaaggagg acatcgagct gccgtccggc aagcaggagt acctggagtc cctcctcaac 13 80 
tcctacatcg tgaagaccat cgccgagctg cgctccgaga aggacgagct gtga 1434 

<210> 38 
<211> 477 
<212> PRT 

<213> Thermotoga maritima 

<400> 38 

Met Lys Glu Thr Ala Ala Ala Lys Phe Glu Arg Gin His Met Asp Ser 

15 10 15 

Pro Asp Leu Gly Thr Leu Val Pro Arg Gly Ser Met Ala Glu Phe Phe 

20 25 30 

Pro Glu lie Pro Lys He Gin Phe Glu Gly Lys Glu Ser Thr Asn Pro 

35 40 45 

Leu Ala Phe Arg Phe Tyr Asp Pro Asn Glu Val He Asp Gly Lys Pro 

50 55 60 

Leu Lys Asp His Leu Lys Phe Ser Val Ala Phe Trp His Thr Phe Val 
65 70 75 80 

Asn Glu Gly Arg Asp Pro Phe Gly Asp Pro Thr Ala Glu Arg Pro Trp 

85 90 95 

Asn Arg Phe Ser Asp Pro Met Asp Lys Ala Phe Ala Arg Val Asp Ala 

100 105 110 

Leu Phe Glu Phe Cys Glu Lys Leu Asn He Glu Tyr Phe Cys Phe His 

115 120 125 

Asp Arg Asp He Ala Pro Glu Gly Lys Thr Leu Arg Glu Thr Asn Lys 

130 135 140 

He Leu Asp Lys Val Val Glu Arg He Lys Glu Arg Met Lys Asp Ser 
145 150 155 160 

Asn Val Lys Leu Leu Trp Gly Thr Ala Asn Leu Phe Ser His Pro Arg 

165 170 175 

Tyr Met His Gly Ala Ala Thr Thr Cys Ser Ala Asp Val Phe Ala Tyr 

180 185 190 

Ala Ala Ala Gin Val Lys Lys Ala Leu Glu He Thr Lys Glu Leu Gly 

195 200 205 

Gly Glu Gly Tyr Val Phe Trp Gly Gly Arg Glu Gly Tyr Glu Thr Leu 

210 215 220 

Leu Asn Thr Asp Leu Gly Leu Glu Leu Glu Asn Leu Ala Arg Phe Leu 
225 230 235 240 

Arg Met Ala Val Glu Tyr Ala Lys Lys He Gly Phe Thr Gly Gin Phe 

245 250 255 

Leu He Glu Pro Lys Pro Lys Glu Pro Thr Lys His Gin Tyr Asp Phe 

260 265 270 

Asp Val Ala Thr Ala Tyr Ala Phe Leu Lys Asn His Gly Leu Asp Glu 

275 280 285 

Tyr Phe Lys Phe Asn He Glu Ala Asn His Ala Thr Leu Ala Gly His 

290 295 300 

Thr Phe Gin His Glu Leu Arg Met Ala Arg He Leu Gly Lys Leu Gly 
305 310 315 320 

Ser He Asp Ala Asn Gin Gly Asp Leu Leu Leu Gly Trp Asp Thr Asp 

325 330 335 

Gin Phe Pro Thr Asn He Tyr Asp Thr Thr Leu Ala Met Tyr Glu Val 
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340 










345 










350 






lie 


Lys 


Ala 


Gly 


Gly 


Phe 


Thr 


Lys 


Gly 


Gly 


Leu 


Asn 


Phe 


Asp 


Ala 


Lys 






355 










360 










365 








Val 


Arg 


Arg 


Ala 


Ser 


Tyr 


Lys 


Val 


Glu 


Asp 


Leu 


Phe 


He 


Gly 


His 


He 




370 










375 










380 










Ala 


Gly 


Met 


Asp 


Thr 


Phe 


Ala 


Leu 


Gly 


Phe 


Lys 


He 


Ala 


Tyr 


Lys 


Leu 


385 








390 










395 










400 


Ala 


Lys 


Asp 


Gly 


Val 


Phe 


Asp 


Lys 


Phe 


lie 


Glu 


Glu 


Lys 


Tyr 


Arg 


Ser 










405 










410 










415 




Phe 


Lys 


Glu 


Gly 


He 


Gly 


Lys 


Glu 


He 


Val 


Glu 


Gly 


Lys 


Thr 


Asp 


Phe 








420 










425 










430 






Glu 


Lys 


Leu 


Glu 


Glu 


Tyr 


He 


He 


Asp 


Lys 


Glu 


Asp 


He 


Glu 


Leu 


Pro 






435 










440 










445 








Ser 


Gly 


Lys 


Gin 


Glu 


Tyr 


Leu 


Glu 


Ser 


Leu 


Leu 


Asn 


Ser 


Tyr 


He 


Val 




450 










455 










460 










Lys 


Thr 


He 


Ala 


Glu 


Leu 


Arg 


Ser 


Glu 


Lys 


Asp 


Glu 


Leu 








465 










470 










475 













<210> 39 
<211> 1434 
<212> DNA 

<213> Thermotoga neapolitana 
<400> 39 

atgaaagaaa ccgctgctgc taaattcgaa cgccagcaca tggacagccc agatctgggt 60 
accctggtgc cacgcggttc catggccgag ttcttcccgg agatcccgaa ggtgcagttc 120 
gagggcaagg agtccaccaa cccgctcgcc ttcaagttct acgacccgga ggagatcatc 180 
gacggcaagc cgctcaagga ccacctcaag ttctccgtgg ccttctggca caccttcgtg 240 
aacgagggcc gcgacccgtt cggcgacccg accgccgacc gcccgtggaa ccgctacacc 3 00 
gacccgatgg acaaggcctt cgcccgcgtg gacgccctct tcgagttctg cgagaagctc 360 
aacatcgagt acttctgctt ccacgaccgc gacatcgccc cggagggcaa gaccctccgc 420 
gagaccaaca agatcctcga caaggtggtg gagcgcatca aggagcgcat gaaggactcc 4 80 
aacgtgaagc tcctctgggg caccgccaac ctcttctccc acccgcgcta catgcacggc 540 
gccgccacca cctgctccgc cgacgtgttc gcctacgccg ccgcccaggt gaagaaggcc 600 
ctggagatca ccaaggagct gggcggcgag ggctacgtgt tctggggcgg ccgcgagggc 660 
tacgagaccc tcctcaacac cgacctcggc ttcgagctgg agaacctcgc ccgcttcctc 720 
cgcatggccg tggactacgc caagcgcatc ggcttcaccg gccagttcct catcgagccg 780 
aagccgaagg agccgaccaa gcaccagtac gacttcgacg tggccaccgc ctacgccttc 840 
ctcaagtccc acggcctcga cgagtacttc aagttcaaca tcgaggccaa ccacgccacc 900 
ctcgccggcc acaccttcca gcacgagctg cgcatggccc gcatcctcgg caagctcggc 960 
tccatcgacg ccaaccaggg cgacctcctc ctcggctggg acaccgacca gttcccgacc 1020 
aacgtgtacg acaccaccct cgccatgtac gaggtgatca aggccggcgg cttcaccaag 1080 
ggcggcctca acttcgacgc caaggtgcgc cgcgcctcct acaaggtgga ggacctcttc 1140 
atcggccaca tcgccggcat ggacaccttc gccctcggct tcaaggtggc ctacaagctc 1200 
gtgaaggacg gcgtgctcga caagttcatc gaggagaagt accgctcctt ccgcgagggc 1260 
atcggccgcg acatcgtgga gggcaaggtg gacttcgaga agctggagga gtacatcatc 1320 
gacaaggaga ccatcgagct gccgtccggc aagcaggagt acctggagtc cctcatcaac 1380 
tcctacatcg tgaagaccat cctggagctg cgctccgaga aggacgagct gtga 1434 

<210> 40 
<211> 477 
<212> PRT 

<213> Thermotoga neapolitana 
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<400> 40 

Met Lys Glu Thr Ala Ala Ala Lys Phe Glu Arg Gin His Met Asp Ser 

15 10 15 

Pro Asp Leu Gly Thr Leu Val Pro Arg Gly Ser Met Ala Glu Phe Phe 

20 25 30 

Pro Glu lie Pro Lys Val Gin Phe Glu Gly Lys Glu Ser Thr Asn Pro 

35 40 45 

Leu Ala Phe Lys Phe Tyr Asp Pro Glu Glu He He Asp Gly Lys Pro 

50 55 60 

Leu Lys Asp His Leu Lys Phe Ser Val Ala Phe Trp His Thr Phe Val 
65 70 75 80 

Asn Glu Gly Arg Asp Pro Phe Gly Asp Pro Thr Ala Asp Arg Pro Trp 

85 90 95 

Asn Arg Tyr Thr Asp Pro Met Asp Lys Ala Phe Ala Arg Val Asp Ala 

100 105 110. 

Leu Phe Glu Phe Cys Glu Lys Leu Asn He Glu Tyr Phe Cys Phe His 

115 120 125 

Asp Arg Asp He Ala Pro Glu Gly Lys Thr Leu Arg Glu Thr Asn Lys 

130 135 140 

He Leu Asp Lys Val Val Glu Arg He Lys Glu Arg Met Lys Asp Ser 
145 150 155 160 

Asn Val Lys Leu Leu Trp Gly Thr Ala Asn Leu Phe Ser His Pro Arg 

165 170 175 

Tyr Met His Gly Ala Ala Thr Thr Cys Ser Ala Asp Val Phe Ala Tyr 

180 185 190 

Ala Ala Ala Gin Val Lys Lys Ala Leu Glu He Thr Lys Glu Leu Gly 

195 200 205 

Gly Glu Gly Tyr Val Phe Trp Gly Gly Arg Glu Gly Tyr Glu Thr Leu 

210 215 220 

Leu Asn Thr Asp Leu Gly Phe Glu Leu Glu Asn Leu Ala Arg Phe Leu 
225 230 235 240 

Arg Met Ala Val Asp Tyr Ala Lys Arg He Gly Phe Thr Gly Gin Phe 

245 250 255 

Leu He Glu Pro Lys Pro Lys Glu Pro Thr Lys His Gin Tyr Asp Phe 

260 265 270 

Asp Val Ala Thr Ala Tyr Ala Phe Leu Lys Ser His Gly Leu Asp Glu 

275 280 285 

Tyr Phe Lys Phe Asn He Glu Ala Asn His Ala Thr Leu Ala Gly His 

290 295 300 

Thr Phe Gin His Glu Leu Arg Met Ala Arg He Leu Gly Lys Leu Gly 
305 310 315 320 

Ser He Asp Ala Asn Gin Gly Asp Leu Leu Leu Gly Trp Asp Thr Asp 

325 330 335 

Gin Phe Pro Thr Asn Val Tyr Asp Thr Thr Leu Ala Met Tyr Glu Val 

340 345 350 

He Lys Ala Gly Gly Phe Thr Lys Gly Gly Leu Asn Phe Asp Ala Lys 

355 360 365 

Val Arg Arg Ala Ser Tyr Lys Val Glu Asp Leu Phe He Gly His He 

370 375 380 

Ala Gly Met Asp Thr Phe Ala Leu Gly Phe Lys Val Ala Tyr Lys Leu 
385 390 395 400 

Val Lys Asp Gly Val Leu Asp Lys Phe He Glu Glu Lys Tyr Arg Ser 

405 410 415 
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Phe Arg Glu Gly lie Gly Arg Asp lie Val Glu Gly Lys Val Asp Phe 

420 425 430 

Glu Lys Leu Glu Glu Tyr lie lie Asp Lys Glu Thr lie Glu Leu Pro 

435 440 445 

Ser Gly Lys Gin Glu Tyr Leu Glu Ser Leu lie Asn Ser Tyr lie Val 

450 455 460 

Lys Thr lie Leu Glu Leu Arg Ser Glu Lys Asp Glu Leu 
465 470 475 



<210> 41 
<211> 1435 
<212> DNA 

<213> Thermotoga maritima 

<400> 41 

atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60 
atggctagca tgactggtgg acagcaaatg ggtcggatcc ccatggccga gttcttcccg 120 
gagatcccga agatccagtt cgagggcaag gagtccacca acccgctcgc cttccgcttc 180 
tacgacccga acgaggtgat cgacggcaag ccgctcaagg accacctcaa gttctccgtg 240 
gccttctggc acaccttcgt gaacgagggc cgcgacccgt tcggcgaccc gaccgccgag 300 
cgcccgtgga accgcttctc cgacccgatg gacaaggcct tcgcccgcgt ggacgccctc 3 60 
ttcgagttct gcgagaagct caacatcgag tacttctgct tccacgaccg cgacatcccc 420 
cggagggcaa gaccctccgc gagaccaaca agatcctcga caaggtggtg gagcgcatca 480 
aggagcgcat gaaggactcc aacgtgaagc tcctctgggg caccgccaac ctcttctccc 540 
acccgcgcta catgcacggc gccgccacca cctgctccgc cgacgtgttc gcctacgccg 600 
ccgcccaggt gaagaaggcc ctggagatca ccaaggagct gggcggcgag ggctacgtgt 660 
tctggggcgg ccgcgagggc tacgagaccc tcctcaacac cgacctcggc ctggagctgg 72 0 
agaacctcgc ccgcttcctc cgcatggccg tggagtacgc caagaagatc ggcttcaccg 780 
gccagttcct catcgagccg aagccgaagg agccgaccaa gcaccagtac gcttcgacgt 84 0 
ggccaccgcc tacgccttcc tcaagaacca cggcctcgac gagtacttca agttcaacat 900 
cgaggccaac cacgccaccc tcgccggcca caccttccag cacgagctgc gcatggcccg 960 
catcctcggc aagctcggct ccatcgacgc caaccagggc gacctcctcc tcggctggga 1020 
caccgaccag ttcccgacca acatctacga caccaccctc gccatgtacg aggtgatcaa 1080 
ggccggcggc ttcaccaagg gcggcctcaa cttcgacgcc aaggtgcgcc gcgcctccta 1140 
caaggtggag gacctcttca tcggccacat cgccggcatg gacaccttcg ccctcggctt 1200 
caagatcgcc tacaagctcg ccaaggacgg cgtgttcgac aagttcatcg aggagaagta 1260 
ccgctccttc aaggagggca tcggcaagga gatcgtggag ggcaagaccg acttcgagaa 1320 
gctggaggag tacatcatcg acaaggagga catcgagctg ccgtccggca agcaggagta 1380 
cctggagtcc ctcctcaact cctacatcgt gaagaccatc gccgagctgc gctga 1435 

<210> 42 
<211> 478 
<212> PRT 

<213> Thermotoga maritima 
<400> 42 

Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 

15 10 15 

Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 

20 25 30 

He Pro Met Ala Glu Phe Phe Pro Glu He Pro Lys He Gin Phe Glu 

35 40 45 

Gly Lys Glu Ser Thr Asn Pro Leu Ala Phe Arg Phe Tyr Asp Pro Asn 
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50 










55 






Glu 


Val 


He 


Asp 


Gly 


Lys 


Pro 


Leu 


Lys 


65 










70 








Ala 


Phe 


Trp 

* It 


His 


Thr 


Phe 


Val 


Asn 


Glu 










85 










Pro 


Thr 


Ala 


Glu 


Arq 


Pro 


Trp 


Asn 


Arg 








100 










105 


Ala 


Phe 


Ala 


Arq 


Val 


Asp 


Ala 


Leu 


Phe 






115 










120 




lie 


Glu 


Tvr 


Phe 


Cys 


Phe 


His 


Asp 


Arg 




130 










135 






Thr 


Leu 


Ara 


Glu 


Thr 


Asn 


Lys 


He 


Leu 


145 










150 








Ly s 


Glu 


Ara 


Met 


Lys 


Asp 


Ser 


Asn 


Val 










165 










Asn 


Leu 


Phe 


Ser 


His 


Pro 


Arg 


Tyr 


Met 








180 










185 


Se r 


Ala 


Asd 


Val 


Phe 


Ala 


Tyr 


Ala 


Ala 






195 










200 




Glu 


He 

-L- JU V-# 


Thr 


Lvs 


Glu 


Leu 


Gly 


Gly 


Glu 




210 










215 






Ara 

X X -A— ^ < 


Glu 


Glv 


Tvr 


Glu 


Thr 


Leu 


Leu 


Asn 


225 










230 








Glu 


Asn 


Leu 


Ala 


Arq 


Phe 


Leu 


Arg 


Met 










245 










lie 


Gly 


Phe 


Thr 


Gly 


Gin 


Phe 


Leu 


He 








260 










265 


Thr 


Lvs 


His 


Gin 


Tyr 


Asp 


Phe 


Asp 


Val 






275 










280 




Lvs 


Asn 


His 


Gly 


Leu 


Asp 


Glu 


Tyr 


Phe 




290 










295 






His 


Ala 


Thr 


Leu 


Ala 


Gly 


His 


Thr 


Phe 


305 










310 








Ara 


He 


Leu 


Gly 


Lys 


Leu 


Gly 


Ser 


He 










325 










Leu 


Leu 


Gly 


Trp 


Asp 


Thr 


Asp 


Gin 


Phe 








340 










345 


Thr 


Leu 


Ala 


Met 


Tyr 


Glu 


Val 


He 


Lys 






355 










360 




Glv 


Leu 


Asn 


Phe 


Asp 


Ala 


Lys 


Val 


Arq 




370 










375 






Asp 


Leu 


Phe 


He 


Gly 


His 


He 


Ala 


Gly 


385 










390 








Phe 


Lys 


He 


Ala 


Tyr 


Lys 


Leu 


Ala 


Lys 










405 










He 


Glu 


Glu 


Lys 


Tyr 


Arg 


Ser 


Phe 


Lys 








420 










425 


Val 


Glu 


Gly 


Lys 


Thr 


Asp 


Phe 


Glu 


Lys 






435 










440 




Lys 


Glu 


Asp 


He 


Glu 


Leu 


Pro 


Ser 


Gly 




450 










455 






Leu 


Leu 


Asn 


Ser 


Tyr 


He 


Val 


Lys 


Thr 
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60 



Asp 


His 


Leu 


Lys 


Phe 


Ser 


Val 




75 










80 


Gly 


Arg 


Asp 


Pro 


Phe 


Gly 


Asp 


90 










95 




Phe 


Ser 


Asp 


Pro 


Met 


Asp 


Lys 










110 






Glu 


Phe 


Cys 


Glu 


Lys 


Leu 


Asn 








125 








Asp 


He 


Ala 


Pro 


Glu 


Gly 


Lys 






140 










Asp 


Lys 


Val 


Val 


Glu 


Arg 


He 




155 










160 


Lys 


Leu 


Leu 


Trp 


Gly 


Thr 


Ala 


170 










175 




His 


Gly 


Ala 


Ala 


Thr 


Thr 


Cys 










190 






Ala 


Gin 


Val 


Lys 


Lys 


Ala 


Leu 








205 








Gly 


Tyr 


Val 


Phe 


Trp 


Gly 


Gly 






220 










Thr 


Asp 


Leu 


Gly 


Leu 


Glu 


Leu 




235 










240 


Ala 


Val 


Glu 


Tyr 


Ala 


Lys 


Lys 


250 










255 




Glu 


Pro 


Lys 


Pro 


Lys 


Glu 


Pro 










270 






Ala 


Thr 


Ala 


Tyr 


Ala 


Phe 


Leu 








285 








Lys 


Phe 


Asn 


He 


Glu 


Ala 


Asn 






300 










Gin 


His 


Glu 


Leu 


Arg 


Met 


Ala 




315 










320 


Asp 


Ala 


Asn 


Gin 


Gly 


Asp 


Leu 


330 










335 




Pro 


Thr 


Asn 


He 


Tyr 


Asp 


Thr 










350 






Ala 


Gly 


Gly 


Phe 


Thr 


Lys 


Gly 








365 








Arg 


Ala 


Ser 


Tyr 


Lys 


Val 


Glu 






380 










Met 


Asp 


Thr 


Phe 


Ala 


Leu 


Gly 




395 










400 


Asp 


Gly 


Val 


Phe 


Asp 


Lys 


Phe 


410 










415 




Glu 


Gly 


He 


Gly 


Lys 


Glu 


He 










430 






Leu 


Glu 


Glu 


Tyr 


He 


He 


Asp 








445 








Lys 


Gin 


Glu 


Tyr 


Leu 


Glu 


Ser 






460 










He 


Ala 


Glu 


Leu 


Arg 







475 
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<210> 43 
<211> 1436 
<212> DNA 

<213> Thermotoga neapolitana 



<400> 43 

atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60 
atggctagca tgactggtgg acagcaaatg ggtcggatcc ccatggccga gttcttcccg 120 
gagatcccga aggtgcagtt cgagggcaag gagtccacca acccgctcgc cttcaagttc 180 
tacgacccgg aggagatcat cgacggcaag ccgctcaagg accacctcaa gttctccgtg 240 
gccttctggc acaccttcgt gaacgagggc cgcgacccgt tcggcgaccc gaccgccgac 300 
cgcccgtgga accgctacac cgacccgatg gacaaggcct tcgcccgcgt ggacgccctc 360 
ttcgagttct gcgagaagct caacatcgag tacttctgct tccacgaccg cgacatcccc 420 
cggagggcaa gaccctccgc gagaccaaca agatcctcga caaggtggtg gagcgcatca 480 
aggagcgcat gaaggactcc aacgtgaagc tcctctgggg caccgccaac ctcttctccc 540 
acccgcgcta catgcacggc gccgccacca cctgctccgc cgacgtgttc gcctacgccg 600 
ccgcccaggt gaagaaggcc ctggagatca ccaaggagct gggcggcgag ggctacgtgt 660 
tctggggcgg ccgcgagggc tacgagaccc tcctcaacac cgacctcggc ttcgagctgg 720 
agaacctcgc ccgcttcctc cgcatggccg tggactacgc caagcgcatc ggcttcaccg 780 
gccagttcct catcgagccg aagccgaagg agccgaccaa gcaccagtac gacttcgacg 840 
tggccaccgc ctacgccttc ctcaagtccc acggcctcga cgagtacttc aagttcaaca 900 
tcgaggccaa ccacgccacc ctcgccggcc acaccttcca gcacgagctg cgcatggccc 960 
gcatcctcgg caagctcggc tccatcgacg ccaaccaggg cgacctcctc ctcggctggg 1020 
acaccgacca gttcccgacc aacgtgtacg acaccaccct cgccatgtac gaggtgatca 1080 
aggccggcgg cttcaccaag ggcggcctca acttcgacgc caaggtgcgc cgcgcctcct 1140 
acaaggtgga ggacctcttc atcggccaca tcgccggcat ggacaccttc gccctcggct 1200 
tcaaggtggc ctacaagctc gtgaaggacg gcgtgctcga caagttcatc gaggagaagt 1260 
accgctcctt ccgcgagggc atcggccgcg acatcgtgga gggcaaggtg gacttcgaga 1320 
agctggagga gtacatcatc gacaaggaga ccatcgagct gccgtccggc aagcaggagt 13 80 
acctggagtc cctcatcaac tcctacatcg tgaagaccat cctggagctg cgctga 1436 



<210> 44 
<211> 478 
<212> PRT 

<213> Thermotoga neapolitana 



<400> 44 
Met Gly Ser Ser 
1 

Arg Gly Ser His 

20 

lie Pro Met Ala 
35 

Gly Lys Glu Ser 
50 

Glu lie He Asp 
65 

Ala Phe Trp His 

Pro Thr Ala Asp 

100 

Ala Phe Ala Arg 
115 



His His His His 

5 

Met Ala Ser Met 

Glu Phe Phe Pro 

40 

Thr Asn Pro Leu 
55 

Gly Lys Pro Leu 
70 

Thr Phe Val Asn 
85 

Arg Pro Trp Asn 

Val Asp Ala Leu 

120 



His 


His 


Ser 


Ser 




10 






Thr 


Gly 


Gly 


Gin 


25 








Glu 


He 


Pro 


Lys 


Ala 


Phe 


Lys 


Phe 








60 


Lys 


Asp 


His 


Leu 






75 




Glu 


Gly 


Arg 


Asp 




90 






Arg 


Tyr 


Thr 


Asp 


105 








Phe 


Glu 


Phe 


Cys 



Gly Leu Val Pro 
15 

Gin Met Gly Arg 
30 

Val Gin Phe Glu 
45 

Tyr Asp Pro Glu 

Lys Phe Ser Val 

80 

Pro Phe Gly Asp 
95 

Pro Met Asp Lys 
110 

Glu Lys Leu Asn 
125 
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He 


Glu 


Tyr 


Phe 


Cys 


Phe 


His 


Asp 


Arg 


Asp 


He 


Ala 


Pro 


Glu 


Gly 


Lys 




130 










135 










140 










Thr 


Leu 


Arg 


Glu 


Thr 


Asn 


Lys 


He 


Leu 


Asp 


Lys 


val 


Val 


Glu 


Arg 


He 


145 








150 










155 










160 


Lys 


Glu 


Arg 


Met 


Lys 


Asp 


Ser 


Asn 


Val 


Lys 


Leu 


Leu 


Trp 


Gly 


Thr 


Ala 








165 










170 










175 




Asn 


Leu 


Phe 


Ser 


His 


Pro 


Arg 


Tyr 


Met 


His 


Gly 


Ala 


Ala 


Thr 


Thr 


Cys 








180 










185 










190 






Ser 


Ala 


Asp 


Val 


Phe 


Ala 


Tyr 


Ala 


Ala 


Ala 


Gin 


Val 


Lys 


Lys 


Ala 


Leu 






195 










200 










205 








Glu 


He 


Thr 


Lys 


Glu 


Leu 


Gly 


Gly 


Glu 


Gly 


Tyr 


val 


Phe 


Trp 


Gly 


Gly 




210 










215 










220 










Arg 


Glu 


Gly 


Tyr 


Glu 


Thr 


Leu 


Leu 


Asn 


Thr 


Asp 


Leu 


Gly 


Phe 


Glu 


Leu 


225 






230 










235 










240 


Glu 


Asn 


Leu 


Ala 


Arg 


Phe 


Leu 


Arg 


Met 


Ala 


Val 


Asp 


Tyr 


Ala 


Lys 


Arg 










245 










250 










255 




He 


Gly 


Phe 


Thr 


Gly 


Gin 


Phe 


Leu 


He 


Glu 


Pro 


Lys 


Pro 


Lys 


Glu 


Pro 






260 










265 










270 






Thr 


Lys 


His 


Gin 


Tyr 


Asp 


Phe 


Asp 


Val 


Ala 


Thr 


Ala 


Tyr 


Ala 


Phe 


Leu 






275 










280 










285 








Lys 


Ser 


His 


Gly 


Leu 


Asp 


Glu 


Tyr 


Phe 


Lys 


Phe 


Asn 


He 


Glu 


Ala 


Asn 


290 










295 










300 










His 


Ala 


Thr 


Leu 


Ala 


Gly 


His 


Thr 


Phe 


Gin 


His 


Glu 


Leu 


Arg 


Met 


Ala 


305 










310 










315 










320 


Arg 


He 


Leu 


Gly 


Lys 


Leu 


Gly 


Ser 


He 


Asp 


Ala 


Asn 


Gin 


Gly 


Asp 


Leu 








325 










330 










335 




Leu 


Leu 


Gly 


Trp 


Asp 


Thr 


Asp 


Gin 


Phe 


Pro 


Thr 


Asn 


Val 


Tyr 


Asp 


Thr 






340 










345 










350 






Thr 


Leu 


Ala 


Met 


Tyr 


Glu 


Val 


He 


Lys 


Ala 


Gly 


Gly 


Phe 


Thr 


Lys 


Gly 






355 










360 










365 








Gly 


Leu 


Asn 


Phe 


Asp 


Ala 


Lys 


Val 


Arg 


Arg 


Ala 


Ser 


Tyr 


Lys 


Val 


Glu 


370 










375 










380 










Asp 


Leu 


Phe 


He 


Gly 


His 


He 


Ala 


Gly 


Met 


Asp 


Thr 


Phe 


Ala 


Leu 


Gly 


385 










390 










395 










400 


Phe 


Lys 


Val 


Ala 


Tyr 


Lys 


Leu 


Val 


Lys 


Asp 


Gly 


Val 


Leu 


Asp 


Lys 


Phe 








405 










410 










415 




He 


Glu 


Glu 


Lys 


Tyr 


Arg 


Ser 


Phe 


Arg 


Glu 


Gly 


He 


Gly 


Arg 


Asp 


He 








420 










425 










430 






Val 


Glu 


Gly 


Lys 


Val 


Asp 


Phe 


Glu 


Lys 


Leu 


Glu 


Glu 


Tyr 


He 


He 


Asp 






435 










440 










445 








Lys 


Glu 


Thr 


He 


Glu 


Leu 


Pro 


Ser 


Gly 


Lys 


Gin 


Glu 


Tyr 


Leu 


Glu 


Ser 


450 










455 










460 










Leu 


He 


Asn 


Ser 


Tyr 


He 


Val 


Lys 


Thr 


He 


Leu 


Glu 


Leu 


Arg 







465 470 475 



<210> 45 
<211> 1095 
<212> PRT 

<213> Aspergillus shirousami 
<400> 45 

Ala Thr Pro Ala Asp Trp Arg Ser 
1 5 



Gin Ser He Tyr Phe Leu Leu Thr 
10 15 
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Asp 


Arg 


Phe 


Ala 

20 


Arg 


Thr 


Asp 


Gly 


Ser 
25 


Thr 


Thr 


Ala 


Thr 


Cys 
30 


Asn 


Thr 


Ala 


Asp 


Gin 


Lys 


Tyr 


Cys 


Gly 


Gly 


Thr 


Trp 


Gin 


Gly 


He 


He 


Asp 


Lys 




35 










40 










45 








Leu 


Asp 
50 


Tyr 


He 


Gin 


Gly 


Met 
55 


Gly 


Phe 


Thr 


Ala 


He 
60 


Trp 


He 


Thr 


Pro 


Val 


Thr 


Ala 


Gin 


Leu 


Pro 


Gin 


Thr 


Thr 


Ala 


Tyr 


Gly 


Asp 


Ala 


Tyr 


His 


65 










70 










75 










80 


Gly 


Tyr 


Trp 


Gin 


Gin 
85 


Asp 


He 


Tyr 


Ser 


Leu 
90 


Asn 


Glu 


Asn 


Tyr 


Gly 
95 


Thr 


Ala 


Asp 


Asp 


Leu 


Lys 


Ala 


Leu 


Ser 


Ser 


Ala 


Leu 


His 


Glu 


Arg 


Gly 


Met 






100 










105 










110 






Tyr 


Leu 


Met 


Val 


Asp 


Val 


Val 


Ala 


Asn 


His 


Met 


Gly 


Tyr 


Asp 


Gly 


Ala 




115 










120 










125 








Gly 


Ser 
130 


Ser 


Val 


Asp 


Tyr 


Ser 
135 


Val 


Phe 


Lys 


Pro 


Phe 
140 


Ser 


Ser 


Gin 


Asp 


Tyr 


Phe 


His 


Pro 


Phe 


Cys 


Phe 


He 


Gin 


Asn 


Tyr 


Glu 


Asp 


Gin 


Thr 


Gin 


145 










150 










155 










160 


Val 


Glu 


Asp 


Cys 


Trp 
165 


Leu 


Gly 


Asp 


Asn 


Thr 
170 


Val 


Ser 


Leu 


Pro 


Asp 
175 


Leu 


Asp 


Thr 


Thr 


Lys 


Asp 


Val 


Val 


Lys 


Asn 


Glu 


Trp 


Tyr 


Asp 


Trp 


Val 


Gly 






180 










185 










190 






Ser 


Leu 


Val 
195 


Ser 


Asn 


Tyr 


Ser 


He 
200 


Asp 


Gly 


Leu 


Arg 


He 
205 


Asp 


Thr 


Val 


Lys 


His 


Val 


Gin 


Lys 


Asp 


Phe 


Trp 


Pro 


Gly 


Tyr 


Asn 


Lys 


Ala 


Ala 


Gly 


210 










215 










220 










Val 


Tyr 


Cys 


He 


Gly 


Glu 


Val 


Leu 


Asp 


Val 


Asp 


Pro 


Ala 


Tyr 


Thr 


Cys 


225 








230 










235 










240 


Pro 


Tyr 


Gin 


Asn 


Val 


Met 


Asp 


Gly 


Val 


Leu 


Asn 


Tyr 


Pro 


He 


Tyr 


Tyr 








245 










250 










255 




Pro 


Leu 


Leu 


Asn 
260 


Ala 


Phe 


Lys 


Ser 


Thr 
265 


Ser 


Gly 


Ser 


Met 


Asp 
270 


Asp 


Leu 


Tyr 


Asn 


Met 


He 


Asn 


Thr 


Val 


Lys 


Ser 


Asp 


Cys 


Pro 


Asp 


Ser 


Thr 


Leu 




275 










280 










285 








Leu 


Gly 
290 


Thr 


Phe 


val 


Glu 


Asn 
295 


His 


Asp 


Asn 


Pro 


Arg 
300 


Phe 


Ala 


Ser 


Tyr 


Thr 


Asn 


Asp 


He 


Ala 


Leu 


Ala 


Lys 


Asn 


Val 


Ala 


Ala 


Phe 


He 


He 


Leu 


305 








310 










315 










320 


Asn 


Asp 


Gly 


He 


Pro 
325 


He 


He 


Tyr 


Ala 


Gly 
330 


Gin 


Glu 


Gin 


His 


Tyr 
335 


Ala 


Gly 


Gly 


Asn 


Asp 


Pro 


Ala 


Asn 


Arg 


Glu 


Ala 


Thr 


Trp 


Leu 


Ser 


Gly 


Tyr 






340 










345 










350 






Pro 


Thr 


Asp 
355 


Ser 


Glu 


Leu 


Tyr 


Lys 
360 


Leu 


He 


Ala 


Ser 


Ala 
365 


Asn 


Ala 


He 


Arg 


Asn 


Tyr 


Ala 


He 


Ser 


Lys 


Asp 


Thr 


Gly 


Phe 


Val 


Thr 


Tyr 


Lys 


Asn 


370 










375 










380 










Trp 


Pro 


He 


Tyr 


Lys 


Asp 


Asp 


Thr 


Thr 


He 


Ala 


Met 


Arg 


Lys 


Gly 


Thr 


385 










390 










395 










400 


Asp 


Gly 


Ser 


Gin 


He 


Val 


Thr 


He 


Leu 


Ser 


Asn 


Lys 


Gly 


Ala 


Ser 


Gly 






405 










410 










415 




Asp 


Ser 


Tyr 


Thr 


Leu 


Ser 


Leu 


Ser 


Gly 


Ala 


Gly 


Tyr 


Thr 


Ala 


Gly 


Gin 






420 










425 










430 






Gin 


Leu 


Thr 
435 


Glu 


Val 


He 


Gly 


Cys 
440 


Thr 


Thr 


Val 


Thr 


Val 
445 


Gly 


Ser 


Asp 
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Gly Asn Val Pro Val Pro Met Ala Gly Gly Leu Pro Arg Val Leu Tyr 

450 455 460 

Pro Thr Glu Lys Leu Ala Gly Ser Lys lie Cys Ser Ser Ser Lys Pro 
465 470 475 480 

Ala Thr Leu Asp Ser Trp Leu Ser Asn Glu Ala Thr Val Ala Arg Thr 

485 490 495 

Ala He Leu Asn Asn He Gly Ala Asp Gly Ala Trp Val Ser Gly Ala 

500 505 510 

Asp Ser Gly He Val Val Ala Ser Pro Ser Thr Asp Asn Pro Asp Tyr 

515 520 525 

Phe Tyr Thr Trp Thr Arg Asp Ser Gly He Val Leu Lys Thr Leu Val 

530 535 540 

Asp Leu Phe Arg Asn Gly Asp Thr Asp Leu Leu Ser Thr He Glu His 
545 550 555 560 

Tyr He Ser Ser Gin Ala He He Gin Gly Val Ser Asn Pro Ser Gly 

565 570 575 

Asp Leu Ser Ser Gly Gly Leu Gly Glu Pro Lys Phe Asn Val Asp Glu 

580 585 590 

Thr Ala Tyr Ala Gly Ser Trp Gly Arg Pro Gin Arg Asp Gly Pro Ala 

595 600 605 

Leu Arg Ala Thr Ala Met He Gly Phe Gly Gin Trp Leu Leu Asp Asn 

610 615 620 

Gly Tyr Thr Ser Ala Ala Thr Glu He Val Trp Pro Leu Val Arg Asn 
625 630 635 640 

Asp Leu Ser Tyr Val Ala Gin Tyr Trp Asn Gin Thr Gly Tyr Asp Leu 

645 650 655 

Trp Glu Glu Val Asn Gly Ser Ser Phe Phe Thr He Ala Val Gin His 

660 665 670 

Arg Ala Leu Val Glu Gly Ser Ala Phe Ala Thr Ala Val Gly Ser Ser 

675 680 685 

Cys Ser Trp Cys Asp Ser Gin Ala Pro Gin He Leu Cys Tyr Leu Gin 

690 695 700 

Ser Phe Trp Thr Gly Ser Tyr He Leu Ala Asn Phe Asp Ser Ser Arg 
705 710 715 720 

Ser Gly Lys Asp Thr Asn Thr Leu Leu Gly Ser He His Thr Phe Asp 

725 730 735 

Pro Glu Ala Gly Cys Asp Asp Ser Thr Phe Gin Pro Cys Ser Pro Arg 

740 745 750 

Ala Leu Ala Asn His Lys Glu Val Val Asp Ser Phe Arg Ser He Tyr 

755 760 765 

Thr Leu Asn Asp Gly Leu Ser Asp Ser Glu Ala Val Ala Val Gly Arg 

770 775 780 

Tyr Pro Glu Asp Ser Tyr Tyr Asn Gly Asn Pro Trp Phe Leu Cys Thr 
785 790 795 800 

Leu Ala Ala Ala Glu Gin Leu Tyr Asp Ala Leu Tyr Gin Trp Asp Lys 

805 810 815 

Gin Gly Ser Leu Glu He Thr Asp Val Ser Leu Asp Phe Phe Lys Ala 

820 825 830 

Leu Tyr Ser Gly Ala Ala Thr Gly Thr Tyr Ser Ser Ser Ser Ser Thr 

835 840 845 

Tyr Ser Ser He Val Ser Ala Val Lys Thr Phe Ala Asp Gly Phe Val 

850 855 860 

Ser He Val Glu Thr His Ala Ala Ser Asn Gly Ser Leu Ser Glu Gin 
865 870 875 880 
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Phe Asp Lys Ser Asp Gly Asp Glu Leu Ser Ala Arg Asp Leu Thr Trp 

885 890 895 

Ser Tyr Ala Ala Leu Leu Thr Ala Asn Asn Arg Arg Asn Ser Val Val 

900 905 910 

Pro Pro Ser Trp Gly Glu Thr Ser Ala Ser Ser Val Pro Gly Thr Cys 

915 920 925 

Ala Ala Thr Ser Ala Ser Gly Thr Tyr Ser Ser Val Thr Val Thr Ser 

930 935 940 

Trp Pro Ser lie Val Ala Thr Gly Gly Thr Thr Thr Thr Ala Thr Thr 
945 950 955 960 

Thr Gly Ser Gly Gly Val Thr Ser Thr Ser Lys Thr Thr Thr Thr Ala 

965 970 975 

Ser Lys Thr Ser Thr Thr Thr Ser Ser Thr Ser Cys Thr Thr Pro Thr 

980 985 990 

Ala Val Ala Val Thr Phe Asp Leu Thr Ala Thr Thr Thr Tyr Gly Glu 

995 1000 1005 

Asn lie Tyr Leu Val Gly Ser lie Ser Gin Leu Gly Asp Trp Glu Thr 

1010 1015 1020 

Ser Asp Gly lie Ala Leu Ser Ala Asp Lys Tyr Thr Ser Ser Asn Pro 
1025 1030 1035 1040 

Pro Trp Tyr Val Thr Val Thr Leu Pro Ala Gly Glu Ser Phe Glu Tyr 

1045 1050 1055 

Lys Phe lie Arg Val Glu Ser Asp Asp Ser Val Glu Trp Glu Ser Asp 

1060 1065 1070 

Pro Asn Arg Glu Tyr Thr Val Pro Gin Ala Cys Gly Glu Ser Thr Ala 

1075 1080 1085 

Thr Val Thr Asp Thr Trp Arg 
1090 1095 



<210> 46 
<211> 3285 
<212> DNA 

<213> Aspergillus shirousami 
<400> 46 

gccaccccgg ccgactggcg ctcccagtcc atctacttcc tcctcaccga ccgcttcgcc 60 
cgcaccgacg gctccaccac cgccacctgc aacaccgccg accagaagta ctgcggcggc 120 
acctggcagg gcatcatcga caagctcgac tacatccagg gcatgggctt caccgccatc 180 
tggatcaccc cggtgaccgc ccagctcccg cagaccaccg cctacggcga cgcctaccac 240 
ggctactggc agcaggacat ctactccctc aacgagaact acggcaccgc cgacgacctc 300 
aaggccctct cctccgccct ccacgagcgc ggcatgtacc tcatggtgga cgtggtggcc 360 
aaccacatgg gctacgacgg cgccggctcc tccgtggact actccgtgtt caagccgttc 420 
tcctcccagg actacttcca cccgttctgc ttcatccaga actacgagga ccagacccag 480 
gtggaggact gctggctcgg cgacaacacc gtgtccctcc cggacctcga caccaccaag 540 
gacgtggtga agaacgagtg gtacgactgg gtgggctccc tcgtgtccaa ctactccatc 600 
gacggcctcc gcatcgacac cgtgaagcac gtgcagaagg acttctggcc gggctacaac 660 
aaggccgccg gcgtgtactg catcggcgag gtgctcgacg tggacccggc ctacacctgc 720 
ccgtaccaga acgtgatgga cggcgtgctc aactacccga tctactaccc gctcctcaac 780 
gccttcaagt ccacctccgg ctcgatggac gacctctaca acatgatcaa caccgtgaag 840 
tccgactgcc cggactccac cctcctcggc accttcgtgg agaaccacga caacccgcgc 900 
ttcgcctcct acaccaacga catcgccctc gccaagaacg tggccgcctt catcatcctc 960 
aacgacggca tcccgatcat ctacgccggc caggagcagc actacgccgg cggcaacgac 1020 
ccggccaacc gcgaggccac ctggctctcc ggctacccga ccgactccga gctgtacaag 1080 
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ctcatcgcct ccgccaacgc catccgcaac tacgccatct ccaaggacac cggcttcgtg 1140 
acctacaaga actggccgat ctacaaggac gacaccacca tcgccatgcg caagggcacc 1200 
gacggctccc agatcgtgac catcctctcc aacaagggcg cctccggcga ctcctacacc 1260 
ctctccctct ccggcgccgg ctacaccgcc ggccagcagc tcaccgaggt gatcggctgc 1320 
accaccgtga ccgtgggctc cgacggcaac gtgccggtgc cgatggccgg cggcctcccg 13 80 
cgcgtgctct acccgaccga gaagctcgcc ggctccaaga tatgctcctc ctccaagccg 1440 
gccaccctcg actcctggct ctccaacgag gccaccgtgg cccgcaccgc catcctcaac 1500 
aacatcggcg ccgacggcgc ctgggtgtcc ggcgccgact ccggcatcgt ggtggcctcc 1560 
ccgtccaccg acaacccgga ctacttctac acctggaccc gcgactccgg catcgtgctc 1620 
aagaccctcg tggacctctt ccgcaacggc gacaccgacc tcctctccac catcgagcac 1680 
tacatctcct cccaggccat catccagggc gtgtccaacc cgtccggcga cctctcctcc 1740 
ggcggcctcg gcgagccgaa gttcaacgtg gacgagaccg cctacgccgg ctcctggggc 1800 
cgcccgcagc gcgacggccc ggccctccgc gccaccgcca tgatcggctt cggccagtgg 1860 
ctcctcgaca acggctacac ctccgccgcc accgagatcg tgtggccgct cgtgcgcaac 1920 
gacctctcct acgtggccca gtactggaac cagaccggct acgacctctg ggaggaggtg 1980 
aacggctcct ccttcttcac catcgccgtg cagcaccgcg ccctcgtgga gggctccgcc 2040 
ttcgccaccg ccgtgggctc ctcctgctcc tggtgcgact cccaggcccc gcagatcctc 2100 
tgctacctcc agtccttctg gaccggctcc tacatcctcg ccaacttcga ctcctcccgc 2160 
tccggcaagg acaccaacac cctcctcggc tccatccaca ccttcgaccc ggaggccggc 2220 
tgcgacgact ccaccttcca gccgtgctcc ccgcgcgccc tcgccaacca caaggaggtg 2280 
gtggactcct tccgctccat ctacaccctc aacgacggcc tctccgactc cgaggccgtg 2340 
gccgtgggcc gctacccgga ggactcctac tacaacggca acccgtggtt cctctgcacc 2400 
ctcgccgccg ccgagcagct ctacgacgcc ctctaccagt gggacaagca gggctccctg 2460 
gagatcaccg acgtgtccct cgacttcttc aaggccctct actccggcgc cgccaccggc 2520 
acctactcct cctcctcctc cacctactcc tccatcgtgt ccgccgtgaa gaccttcgcc 2580 
gacggcttcg tgtccatcgt ggagacccac gccgcctcca acggctccct ctccgagcag 2640 
ttcgacaagt ccgacggcga cgagctgtcc gcccgcgacc tcacctggtc ctacgccgcc 2700 
ctcctcaccg ccaacaaccg ccgcaactcc gtggtgccgc cgtcctgggg cgagacctcc 2760 
gcctcctccg tgccgggcac ctgcgccgcc acctccgcct ccggcaccta ctcctccgtg 2820 
accgtgacct cctggccgtc catcgtggcc accggcggca ccaccaccac cgccaccacc 2880 
accggctccg gcggcgtgac ctccacctcc aagaccacca ccaccgcctc caagacctcc 2940 
accaccacct cctccacctc ctgcaccacc ccgaccgccg tggccgtgac cttcgacctc 3000 
accgccacca ccacctacgg cgagaacatc tacctcgtgg gctccatctc ccagctcggc 3060 
gactgggaga cctccgacgg catcgccctc tccgccgaca agtacacctc ctccaacccg 3120 
ccgtggtacg tgaccgtgac cctcccggcc ggcgagtcct tcgagtacaa gttcatccgc 3180 
gtggagtccg acgactccgt ggagtgggag tccgacccga accgcgagta caccgtgccg 3240 
caggcctgcg gcgagtccac cgccaccgtg accgacacct ggcgc 3285 

<210> 47 
<211> 679 
<212> PRT 

<213> Thermoanaerobacterium thermosaccharolyticum 



<400> 47 
Val Leu Ser Gly 
1 

Phe Asn Asn lie 

20 

Ala Ser Ala Gin 
35 

Arg Val Trp Phe 
50 

Pro Thr lie Asp 
65 



Cys Ser Asn Asn 
5 

Ser Ala Val Asn 

Lys Gin Gly Val 

40 

Thr Leu Ala Asn 
55 

Thr Ala Asp Val 
70 



Val Ser Ser lie 
10 

Gly Pro Gly Glu 
25 

Gly Thr Ala Asn 

Gly Ala lie Ser 

60 

Lys Glu lie Lys 
75 
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Lys lie Asp Arg 
15 

Glu Asp Thr Trp 
30 

Asn Tyr Val Ser 
45 

Glu Val Tyr Tyr 

Phe lie Val Thr 
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Asp 


Gly 


Lys 


Ser 


Phe 


Val 


Ser 


Asp 


Glu 


Thr 


Lys 


Asp 


Ala 


He 


Ser 


Lys 




85 










90 










95 




Val 


Glu 


Lys 


Phe 


Thr 


Asp 


Lys 


Ser 


Leu 


Gly 


Tyr 


Lys 


Leu 


Val 


Asn 


Thr 






100 










105 










110 






Asp 


Lys 


Lys 


Gly 


Arg 


Tyr 


Arg 


He 


Thr 


Lys 


Glu 


He 


Phe 


Thr 


Asp 


Val 






115 










120 










125 








Lys 


Arg 


Asn 


Ser 


Leu 


He 


Met 


Lys 


Ala 


Lys 


Phe 


Glu 


Ala 


Leu 


Glu 


Gly 




130 










135 










140 










Ser 


He 


His 


Asp 


Tyr 


Lys 


Leu 


Tyr 


Leu 


Ala 


Tyr 


Asp 


Pro 


His 


He 


Lys 


145 










150 










155 










160 


Asn 


Gin 


Gly 


Ser 


Tyr 


Asn 


Glu 


Gly 


Tyr 


Val 


He 


Lys 


Ala 


Asn 


Asn 


Asn 










165 










170 










175 




Glu 


Met 


Leu 


Met 


Ala 


Lys 


Arg 


Asp 


Asn 


Val 


Tyr 


Thr 


Ala 


Leu 


Ser 


Ser 








180 










185 










190 






Asn 


He 


Gly 


Trp 


Lys 


Gly 


Tyr 


Ser 


He 


Gly 


Tyr 


Tyr 


Lys 


Val 


Asn 


Asp 






195 










200 










205 








lie 


Met 


Thr 


Asp 


Leu 


Asp 


Glu 


Asn 


Lys 


Gin 


Met 


Thr 


Lys 


His 


Tyr 


Asp 




210 










215 










220 










Ser 


Ala 


Arg 


Gly 


Asn 


He 


He 


Glu 


Gly 


Ala 


Glu 


He 


Asp 


Leu 


Thr 


Lys 


225 










230 










235 










240 


Asn 


Ser 


Glu 


Phe 


Glu 


He 


Val 


Leu 


Ser 


Phe 


Gly 


Gly 


Ser 


Asp 


Ser 


Glu 










245 










250 










255 




Ala 


Ala 


Lys 


Thr 


Ala 


Leu 


Glu 


Thr 


Leu 


Gly 


Glu 


Asp 


Tyr 


Asn 


Asn 


Leu 






260 










265 










270 






Lys 


Asn 


Asn 


Tyr 


He 


Asp 


Glu 


Trp 


Thr 


Lys 


Tyr 


Cys 


Asn 


Thr 


Leu 


Asn 




275 










280 










285 








Asn 


Phe 


Asn 


Gly 


Lys 


Ala 


Asn 


Ser 


Leu 


Tyr 


Tyr 


Asn 


Ser 


Met 


Met 


He 




290 










295 










300 










Leu 


Lys 


Ala 


Ser 


Glu 


Asp 


Lys 


Thr 


Asn 


Lys 


Gly 


Ala 


Tyr 


He 


Ala 


Ser 


305 








310 










315 










320 


Leu 


Ser 


He 


Pro 


Trp 


Gly 


Asp 


Gly 


Gin 


Arg 


Asp 


Asp 


Asn 


Thr 


Gly 


Gly 










325 










330 










335 




Tyr 


His 


Leu 


Val 


Trp 


Ser 


Arg 


Asp 


Leu 


Tyr 


His 


Val 


Ala 


Asn 


Ala 


Phe 






340 










345 










350 






He 


Ala 


Ala 


Gly 


Asp 


Val 


Asp 


Ser 


Ala 


Asn 


Arg 


Ser 


Leu 


Asp 


Tyr 


Leu 






355 










360 










365 








Ala 


Lys 


Val 


Val 


Lys 


Asp 


Asn 


Gly 


Met 


He 


Pro 


Gin 


Asn 


Thr 


Trp 


He 




370 










375 










380 










Ser 


Gly 


Lys 


Pro 


Tyr 


Trp 


Thr 


Ser 


He 


Gin 


Leu 


Asp 


Glu 


Gin 


Ala 


Asp 


385 










390 










395 










400 


Pro 


He 


He 


Leu 


Ser 


Tyr 


Arg 


Leu 


Lys 


Arg 


Tyr 


Asp 


Leu 


Tyr 


Asp 


Ser 










405 










410 










415 




Leu 


Val 


Lys 


Pro 


Leu 


Ala 


Asp 


Phe 


He 


He 


Lys 


He 


Gly 


Pro 


Lys 


Thr 








420 










425 










430 






Gly 


Gin 


Glu 


Arg 


Trp 


Glu 


Glu 


He 


Gly 


Gly 


Tyr 


Ser 


Pro 


Ala 


Thr 


Met 




435 










440 










445 








Ala 


Ala 


Glu 


Val 


Ala 


Gly 


Leu 


Thr 


Cys 


Ala 


Ala 


Tyr 


He 


Ala 


Glu 


Gin 




450 










455 










460 










Asn 


Lys 


Asp 


Tyr 


Glu 


Ser 


Ala 


Gin 


Lys 


Tyr 


Gin 


Glu 


Lys 


Ala 


Asp 


Asn 


465 










470 










475 










480 


Trp 


Gin 


Lys 


Leu 


He 


Asp 


Asn 


Leu 


Thr 


Tyr 


Thr 


Glu 


Asn 


Gly 


Pro 


Leu 








485 










490 










495 




Gly 


Asn 


Gly 


Gin 


Tyr 


Tyr 


He 


Arg 


He 


Ala 


Gly 


Leu 


Ser 


Asp 


Pro 


Asn 






500 










505 










510 







221 
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Ala 


Asp 


Phe 


Met 


He 


Asn 


He 


Ala 


Asn 


Gly 


Gly 


Gly 


Val 


Tyr 


Asp 


Gin 




515 










520 










525 








Lys 


Glu 


He 


Val 


Asp 


Pro 


Ser 


Phe 


Leu 


Glu 


Leu 


Val 


Arg 


Leu 


Gly 


Val 


530 










535 










540 










Lys 


Ser 


Ala 


Asp 


Asp 


Pro 


Lys 


He 


Leu 


Asn 


Thr 


Leu 


Lys 


Val 


Val 


Asp 


545 










550 










555 










560 


Ser 


Thr 


He 


Lys 


val 


Asp 


Thr 


Pro 


Lys 


Gly 


Pro 


Ser 


Trp 


Tyr 


Arg 


Tyr 








565 










570 










575 




Asn 


His 


Asp 


Gly 
580 


Tyr 


Gly 


Glu 


Pro 


Ser 
585 


Lys 


Thr 


Glu 


Leu 


Tyr 
590 


His 


Gly 


Ala 


Gly 


Lys 
595 


Gly 


Arg 


Leu 


Trp 


Pro 
600 


Leu 


Leu 


Thr 


Gly 


Glu 
605 


Arg 


Gly 


Met 


Tyr 


Glu 
610 


He 


Ala 


Ala 


Gly 


Lys 
615 


Asp 


Ala 


Thr 


Pro 


Tyr 
620 


Val 


Lys 


Ala 


Met 


Glu 


Lys 


Phe 


Ala 


Asn 


Glu 


Gly 


Gly 


He 


He 


Ser 


Glu 


Gin 


Val 


Trp 


Glu 


625 








630 










635 










640 


Asp 


Thr 


Gly 


Leu 


Pro 


Thr 


Asp 


Ser 


Ala 


Ser 


Pro 


Leu 


Asn 


Trp 


Ala 


His 






645 










650 










655 




Ala 


Glu 


Tyr 


Val 


He 


Leu 


Phe 


Ala 


Ser 


Asn 


He 


Glu 


His 


Lys 


Val 


Leu 






660 










665 










670 






Asp 


Met 


Pro 


Asp 


He 


Val 


Tyr 





















675 



<210> 48 
<211> 2037 
<212> DNA 

<213> Thermoanaerobacterium thermosaccharolyt icum 
<220> 

<223> synthetic 
<400> 48 

gtgctctccg gctgctccaa caacgtgtcc tccatcaaga tcgaccgctt caacaacatc 60 
tccgccgtga acggcccggg cgaggaggac acctgggcct ccgcccagaa gcagggcgtg 120 
ggcaccgcca acaactacgt gtcccgcgtg tggttcaccc tcgccaacgg cgccatctcc 180 
gaggtgtact acccgaccat cgacaccgcc gacgtgaagg agatcaagtt catcgtgacc 240 
gacggcaagt ccttcgtgtc cgacgagacc aaggacgcca tctccaaggt ggagaagttc 300 
accgacaagt ccctcggcta caagctcgtg aacaccgaca agaagggccg ctaccgcatc 360 
accaaggaaa tcttcaccga cgtgaagcgc aactccctca tcatgaaggc caagttcgag 420 
gccctcgagg gctccatcca cgactacaag ctctacctcg cctacgaccc gcacatcaag 480 
aaccagggct cctacaacga gggctacgtg atcaaggcca acaacaacga gatgctcatg 540 
gccaagcgcg acaacgtgta caccgccctc tcctccaaca tcggctggaa gggctactcc 600 
atcggctact acaaggtgaa cgacatcatg accgacctcg acgagaacaa gcagatgacc 660 
aagcactacg actccgcccg cggcaacatc atcgagggcg ccgagatcga cctcaccaag 720 
aactccgagt tcgagatcgt gctctccttc ggcggctccg actccgaggc cgccaagacc 780 
gccctcgaga ccctcggcga ggactacaac aacctcaaga acaactacat cgacgagtgg 
accaagtact gcaacaccct caacaacttc aacggcaagg ccaactccct ctactacaac 
tccatgatga tcctcaaggc ctccgaggac aagaccaaca agggcgccta catcgcctcc 960 
ctctccatcc cgtggggcga cggccagcgc gacgacaaca ccggcggcta ccacctcgtg 1020 
tggtcccgcg acctctacca cgtggccaac gccttcatcg ccgccggcga cgtggactcc 1080 
gccaaccgct ccctcgacta cctcgccaag gtggtgaagg acaacggcat gatcccgcag 114 0 
aacacctgga tctccggcaa gccgtactgg acctccatcc agctcgacga gcaggccgac 1200 
ccgatcatcc tctcctaccg cctcaagcgc tacgacctct acgactccct cgtgaagccg 1260 
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ctcgccgact tcatcatcaa gatcggcccg aagaccggcc aggagcgctg ggaggagatc 132 0 
ggcggctact ccccggccac gatggccgcc gaggtggccg gcctcacctg cgccgcctac 1380 
atcgccgagc agaacaagga ctacgagtcc gcccagaagt accaggagaa ggccgacaac 1440 
tggcagaagc tcatcgacaa cctcacctac accgagaacg gcccgctcgg caacggccag 1500 
tactacatcc gcatcgccgg cctctccgac ccgaacgccg acttcatgat caacatcgcc 1560 
aacggcggcg gcgtgtacga ccagaaggag atcgtggacc cgtccttcct cgagctggtg 1620 
cgcctcggcg tgaagtccgc cgacgacccg aagatcctca acaccctcaa ggtggtggac 1680 
tccaccatca aggtggacac cccgaagggc ccgtcctggt atcgctacaa ccacgacggc 1740 
tacggcgagc cgtccaagac cgagctgtac cacggcgccg gcaagggccg cctctggccg 1800 
ctcctcaccg gcgagcgcgg catgtacgag atcgccgccg gcaaggacgc caccccgtac 1860 
gtgaaggcga tggagaagtt cgccaacgag ggcggcatca tctccgagca ggtgtgggag 1920 
gacaccggcc tcccgaccga ctccgcctcc ccgctcaact gggcccacgc cgagtacgtg 1980 
atcctcttcg cctccaacat cgagcacaag gtgctcgaca tgccggacat cgtgtac 2037 



<210> 49 
<211> 579 
<212> PRT 

<213> Rhizopus oryzae 



<400> 49 
Ala Ser lie Pro 
1 

Asp Gly Ser Thr 

20 

Ser Lys Lys Val 
35 

Asn Asn Gly Asn 
50 

Ser Asn Tyr Glu 
65 

Glu Phe Tyr lie 

Asn Asn Ser Ala 

100 

Thr Ala Thr Ala 
115 

Pro Pro Ser Arg 
130 

lie Ser Ser Trp 
145 

Leu Arg Asn lie 

Ser Leu Ser Thr 

180 

Ala Ala Leu Thr 
195 

Ser Gly Asn Lys 
210 

Ser Val Lys Thr 
225 

Pro Lys Phe Asn 

Pro Gin Asn Asp 

260 



Ser Ser Ala Ser 

5 

Phe Ser Gly Lys 

Thr Val lie Tyr 

40 

Thr lie Ala Ala 
55 

Tyr Trp Thr Phe 
70 

Lys Tyr Glu Val 
85 

Asn Tyr Gin Val 

Thr Thr Thr Thr 

120 

Ser Glu Pro Ala 
135 

lie Lys Lys Gin 
150 

Asn Pro Pro Gly 
165 

Ala Gly Pro Asp 

Ser Asn Val lie 

200 

Thr lie Leu Asn 
215 

Gin Ser Thr Ser 
230 

Pro Asp Ala Ser 
245 

Gly Pro Ala Glu 



Val Gin Leu Asp 
10 

lie Tyr Val Lys 
25 

Ala Asp Gly Ser 

Ser Tyr Ser Ala 

60 

Ser Ala Ser lie 
75 

Ser Gly Lys Thr 
90 

Ser Thr Ser Lys 
105 

Ala Pro Ser Thr 

Thr Phe Pro Thr 

140 

Glu Gly lie Ser 
155 

Ser Ala Thr Gly 
170 

Tyr Tyr Tyr Ala 
185 

Val Tyr Glu Tyr 

Val Leu Lys Asp 

220 

Thr Val Cys Asn 
235 

Gly Tyr Thr Gly 
250 

Arg Ala Thr Thr 
265 



Ser Tyr Asn Tyr 
15 

Asn lie Ala Tyr 
30 

Asp Asn Trp Asn 
45 

Pro lie Ser Gly 

Asn Gly lie Lys 

80 

Tyr Tyr Asp Asn 
95 

Pro Thr Thr Thr 
110 

Ser Thr Thr Thr 
125 

Gly Asn Ser Thr 

Arg Phe Ala Met 

160 

Phe lie Ala Ala 
175 

Trp Thr Arg Asp 
190 

Asn Thr Thr Leu 
205 

Tyr Val Thr Phe 

Cys Leu Gly Glu 

240 

Ala Trp Gly Arg 
255 

Phe lie Leu Phe 
270 
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Aj.a 


Asp 


Ser 


Tyr 






275 




Thr 


Leu 


Lys 


Pro 




290 






Trp 


Ser 


Asn 


Gly 


305 








Phe 


Tyr 


Thr 


Leu 


Phe 


Ala 


Lys 


Arg 








340 


1 fllT 


Ala 


ser 


i nr 






355 




Asn 


Asn 


Trp 


He 




370 






Lys 


Gly 


Leu 


Asp 


385 








Asp 


Asp 


Gly 


Phe 


Val 


Ala 


Val 


Glu 








420 


Leu 


Pro 


Ser 


Tyr 






435 




Tyr 


Asn 


Gly 


Asn 




450 






Thr 


Gly 


Tyr 


Ala 


465 








Asn 


Gly 


Gly 


Val 


Phe 


Asp 


Ser 


Ser 








500 


Asp 


Pne 


Asn 


Asn 






515 




Leu 


Ser 


Thr 


Val 




530 






Glu 


Phe 


Asp 


Arg 


545 








Trp 


Ser 


His 


Ala 


Pro 


Ala 


Ala 





Leu Thr Gin Thr 

280 

Ala He Phe Lys 
295 

Cys Phe Asp Leu 
310 

Met Val Met Arg 
325 

Asn Gly Asp Ser 

He Ala Asn Lys 

360 

Gin Val Ser Gin 
375 

Val Ser Thr Leu 
390 

Phe Thr Pro Gly 
405 

Asp Ser Phe Ala 

Leu Gly Asn Ser 

440 

Gly Asn Ser Gin 
455 

Glu Leu Tyr Tyr 
470 

Thr Val Ser Ser 
485 

Ala Thr Ser Gly 

Leu Ala Gin Asn 

520 

Gin Leu His Ala 
535 

Thr Thr Gly Leu 
550 

Ser Leu He Thr 
565 



Lys Asp Ala Ser 

Asp Leu Asp Tyr 

300 

Trp Glu Glu Val 
315 

Lys Gly Leu Leu 
330 

Thr Arg Ala Ser 
345 

He Ser Ser Phe 

Ser Val Thr Gly 

380 

Leu Ala Ala Asn 
395 

Ser Glu Lys He 
410 

Ser Leu Tyr Pro 
425 

lie Gly Arg Tyr 

Gly Asn Ser Trp 

460 

Arg Ala He Lys 
475 

He Ser Leu Pro 
490 

Lys Lys Tyr Thr 
505 

He Ala Leu Ala 

His Asn Asn Gly 

540 

Ser Thr Gly Ala 
555 

Ala Ser Tyr Ala 
570 



Tyr Val Thr Gly 
285 

Val Val Asn Val 

Asn Gly Val His 

320 

Leu Gly Ala Asp 
335 

Thr Tyr Ser Ser 
350 

Trp Val Ser Ser 
365 

Gly Val Ser Lys 

Leu Gly Ser Val 

400 

Leu Ala Thr Ala 
415 

lie Asn Lys Asn 
430 

Pro Glu Asp Thr 
445 

Phe Leu Ala Val 

Glu Trp lie Gly 

480 

Phe Phe Lys Lys 
495 

Val Gly Thr Ser 
510 

Ala Asp Arg Phe 
525 

Ser Leu Ala Glu 

Arg Asp Leu Thr 

560 

Lys Ala Gly Ala 
575 



<210> 50 

<211> 1737 

<212> DNA 

<213> Rhizopus oryzae 



<400> 50 

gcctccatcc 

ttctccggca 

gccgacggct 

ccgatctccg 

gagt tctaca 

aactaccagg 



cgtcctccgc 
aaatctacgt 
ccgacaactg 
gctccaacta 
tcaagtacga 
tgtccacctc 



ctccgtgcag 
gaagaacatc 
gaacaacaac 
cgagtactgg 
ggtgtccggc 
caagccgacc 



ctcgactcct 
gcctactcca 
ggcaacacca 
accttctccg 
aagacctact 
accaccaccg 

224 



acaactacga 
agaaggtgac 
tcgccgcctc 
cctccatcaa 
acgacaacaa 
ccaccgccac 



cggctccacc 60 
cgtgatctac 120 
ctactccgcc 180 
cggcatcaag 240 
caactccgcc 300 
caccaccacc 360 
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gccccgtcca cctccaccac caccccgccg tcccgctccg agccggccac cttcccgacc 420 

ggcaactcca ccatctcctc ctggatcaag aagcaggagg gcatctcccg cttcgccatg 480 

ctccgcaaca tcaacccgcc gggctccgcc accggcttca tcgccgcctc cctctccacc 540 

gccggcccgg actactacta cgcctggacc cgcgacgccg ccctcacctc caacgtgatc 600 

gtgtacgagt acaacaccac cctctccggc aacaagacca tcctcaacgt gctcaaggac 660 

tacgtgacct tctccgtgaa gacccagtcc acctccaccg tgtgcaactg cctcggcgag 720 

ccgaagttca acccggacgc ctccggctac accggcgcct ggggccgccc gcagaacgac 7 80 

ggcccggccg agcgcgccac caccttcatc ctcttcgccg actcctacct cacccagacc 840 

aaggacgcct cctacgtgac cggcaccctc aagccggcca tcttcaagga cctcgactac 900 

gtggtgaacg tgtggtccaa cggctgcttc gacctctggg aggaggtgaa cggcgtgcac 960 

ttctacaccc tcatggtgat gcgcaagggc ctcctcctcg gcgccgactt cgccaagcgc 1020 

aacggcgact ccacccgcgc ctccacctac tcctccaccg cctccaccat cgccaacaaa 1080 

atctcctcct tctgggtgtc ctccaacaac tggatacagg tgtcccagtc cgtgaccggc 1140 

ggcgtgtcca agaagggcct cgacgtgtcc accctcctcg ccgccaacct cggctccgtg 1200 

gacgacggct tcttcacccc gggctccgag aagatcctcg ccaccgccgt ggccgtggag 1260 

gactccttcg cctccctcta cccgatcaac aagaacctcc cgtcctacct cggcaactcc 1320 

atcggccgct acccggagga cacctacaac ggcaacggca actcccaggg caactcctgg 1380 

ttcctcgccg tgaccggcta cgccgagctg tactaccgcg ccatcaagga gtggatcggc 1440 

aacggcggcg tgaccgtgtc ctccatctcc ctcccgttct tcaagaagtt cgactcctcc 1500 

gccacctccg gcaagaagta caccgtgggc acctccgact tcaacaacct cgcccagaac 1560 

atcgccctcg ccgccgaccg cttcctctcc accgtgcagc tccacgccca caacaacggc 1620 

tccctcgccg aggagttcga ccgcaccacc ggcctctcca ccggcgcccg cgacctcacc 1680 

tggtcccacg cctccctcat caccgcctcc tacgccaagg ccggcgcccc ggccgcc 1737 

<210> 51 
<211> 439 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 51 

Met Ala Lys His Leu Ala Ala Met Cys Trp Cys Ser Leu Leu Val Leu 

15 10 15 

Val Leu Leu Cys Leu Gly Ser Gin Leu Ala Gin Ser Gin Val Leu Phe 

20 25 30 

Gin Gly Phe Asn Trp Glu Ser Trp Lys Lys Gin Gly Gly Trp Tyr Asn 

35 40 45 

Tyr Leu Leu Gly Arg Val Asp Asp lie Ala Ala Thr Gly Ala Thr His 

50 55 60 

Val Trp Leu Pro Gin Pro Ser His Ser Val Ala Pro Gin Gly Tyr Met 
65 70 75 80 

Pro Gly Arg Leu Tyr Asp Leu Asp Ala Ser Lys Tyr Gly Thr His Ala 

85 90 95 

Glu Leu Lys Ser Leu Thr Ala Ala Phe His Ala Lys Gly Val Gin Cys 

100 105 110 

Val Ala Asp Val Val He Asn His Arg Cys Ala Asp Tyr Lys Asp Gly 

115 120 125 

Arg Gly He Tyr Cys Val Phe Glu Gly Gly Thr Pro Asp Ser Arg Leu 

130 135 140 

Asp Trp Gly Pro Asp Met He Cys Ser Asp Asp Thr Gin Tyr Ser Asn 
145 150 155 160 

Gly Arg Gly His Arg Asp Thr Gly Ala Asp Phe Ala Ala Ala Pro Asp 

225 
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165 



lie 


Asp 


His 


Leu 


Asn 


Pro 


Arg 


Val 








180 










Asn 


Trp 


Leu 


Lys 


Ser 


Asp 


Leu 


Gly 






195 










200 


Ala 


Lys 


Gly 


Tyr 


Ser 


Ala 


Ala 


Val 




210 










215 




Ala 


Pro 


Thr 


Phe 


Val 


Val 


Ala 


Glu 


225 










230 






Gly 


Asn 


Gly 


Glu 


Pro 


Ser 


Ser 


Asn 










245 








Val 


Asn 


Trp 


Ala 


Gin 


Ala 


Val 


Gly 








260 










Thr 


Thr 


Lys 


Gly 


Val 


Leu 


Gin 


Ala 






275 










280 


Met 


Lys 


Asp 


Gly 


Asn 


Gly 


Lys 


Ala 




290 










295 




Glu 


Lys 


Ala 


Val 


Thr 


Phe 


Val 


Asp 


305 










310 






Asn 


Ser 


Trp 


Pro 


Phe 


Pro 


Ser 


Asp 










325 








He 


Leu 


Thr 


His 


Pro 


Gly 


Thr 


Pro 








340 










Asp 


Trp 


Asn 


Leu 


Lys 


Gin 


Glu 


He 






355 










360 


Arg 


Asn 


Gly 


He 


His 


Pro 


Gly 


Ser 




370 










375 




Gly 


Asp 


Leu 


Tyr 


Val 


Ala 


Lys 


He 


385 










390 






Gly 


Ser 


Arg 


Tyr 


Asp 


Val 


Gly 


Asn 










405 








Val 


Ala 


His 


Gly 


Asn 


Asn 


Tyr 


Cys 








420 










Val 


Pro 


Ala 


Gly 


Arg 


His 


His 







170 










175 




Gin 


Gin 


Glu 


Leu 


Ser 


Asp 


Trp 


Leu 


185 










190 






Phe 


Asp 


Gly 


Trp 


Arg 


Leu 


Asp 


Phe 










205 








Ala 


Lys 


Val 


Tyr 


Val 


Asp 


Ser 


Thr 








220 










He 


Trp 


Ser 


Ser 


Leu 


His 


Tyr 


Asp 






235 










240 


Gin 


Asp 


Ala 


Asp 


Arg 


Gin 


Glu 


Leu 




250 










255 




Gly 


Pro 


Ala 


Ala 


Ala 


Phe 


Asp 


Phe 


265 










270 






Ala 


Val 


Gin 


Gly 


Glu 


Leu 


Trp 


Arg 










285 








Pro 


Gly 


Met 


He 


Gly 


Trp 


Leu 


Pro 








300 










Asn 


His 


Asp 


Thr 


Gly 


Ser 


Thr 


Gin 






315 










320 


Lys 


Val 


Met 


Gin 


Gly 


Tyr 


Ala 


Tyr 




330 










335 




Cys 


He 


Phe 


Tyr 


Asp 


His 


Val 


Phe 


345 










350 






Ser 


Ala 


Leu 


Ser 


Ala 


Val 


Arg 


Ser 










365 








Glu 


Leu 


Asn 


He 


Leu 


Ala 


Ala 


Asp 








380 










Asp 


Asp 


Lys 


Val 


He 


Val 


Lys 


He 






395 










400 


Leu 


lie 


Pro 


Ser 


Asp 


Phe 


His 


Ala 




410 










415 




Val 


Trp 


Glu 


Lys 


His 


Gly 


Leu 


Arg 


425 










430 







<210> 52 
<211> 1320 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 52 

atggcgaagc acttggctgc catgtgctgg 
ttgggctccc agctggccca atcccaggtc 
aagaagcaag gtgggtggta caactacctc 
ggggccacgc acgtctggct cccgcagccg 
cccggccggc tctacgacct ggacgcgtcc 
ctcaccgcgg cgttccacgc caagggcgtc 
cgctgcgccg actacaagga cggccgcggc 
gacagccgcc tcgactgggg ccccgacatg 



tgcagcctcc tagtgcttgt actgctctgc 60 
ctcttccagg ggttcaactg ggagtcgtgg 120 
ctggggcggg tggacgacat cgccgcgacg 180 
tcgcactcgg tggcgccgca ggggtacatg 24 0 
aagtacggca cccacgcgga gctcaagtcg 300 
cagtgcgtcg ccgacgtcgt gatcaaccac 360 
atctactgcg tcttcgaggg cggcacgccc 420 
atctgcagcg acgacacgca gtactccaac 480 
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gggcgcgggc accgcgacac gggggccgac ttcgccgccg cgcccgacat cgaccacctc 540 
aacccgcgcg tgcagcagga gctctcggac tggctcaact ggctcaagtc cgacctcggc 600 
ttcgacggct ggcgcctcga cttcgccaag ggctactccg ccgccgtcgc caaggtgtac 660 
gtcgacagca ccgcccccac cttcgtcgtc gccgagatat ggagctccct ccactacgac 720 
ggcaacggcg agccgtccag caaccaggac gccgacaggc aggagctggt caactgggcg 780 
caggcggtgg gcggccccgc cgcggcgttc gacttcacca ccaagggcgt gctgcaggcg 840 
gccgtccagg gcgagctgtg gcgcatgaag gacggcaacg gcaaggcgcc cgggatgatc 900 
ggctggctgc cggagaaggc cgtcacgttc gtcgacaacc acgacaccgg ctccacgcag 960 
aactcgtggc cattcccctc cgacaaggtc atgcagggct acgcctatat cctcacgcac 1020 
ccaggaactc catgcatctt ctacgaccac gttttcgact ggaacctgaa gcaggagatc 1080 
agcgcgctgt ctgcggtgag gtcaagaaac gggatccacc cggggagcga gctgaacatc 1140 
ctcgccgccg acggggatct ctacgtcgcc aagattgacg acaaggtcat cgtgaagatc 1200 
gggtcacggt acgacgtcgg gaacctgatc ccctcagact tccacgccgt tgcccctggc 1260 
aacaactact gcgtttggga gaagcacggt ctgagagttc cagcggggcg gcaccactag 1320 



<210> 53 
<211> 45 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 

Thr Gly Ser Gly Gly 

15 

Ser Lys Thr Ser Thr 
30 

Ala Val 
45 



<400> 53 

Ala Thr Gly Gly Thr Thr Thr Thr Ala Thr Thr 

15 10 
Val Thr Ser Thr Ser Lys Thr Thr Thr Thr Ala 

20 25 
Thr Thr Ser Ser Thr Ser Cys Thr Thr Pro Thr 
35 40 



<210> 54 
<211> 137 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 54 

gccaccggcg gcaccaccac caccgccacc accaccggct ccggcggcgt gacctccacc 60 
tccaagacca ccaccaccgc ctccaagacc tccaccacca cctcctccac ctcctgcacc 120 
accccgaccg ccgtgtc 137 

<210> 55 
<211> 300 
<212> PRT 

<213> Pyrococcus furiosus 
<400> 55 

He Tyr Phe Val Glu Lys Tyr His Thr Ser Glu Asp Lys Ser Thr Ser 

! 5 10 15 

Asn Thr Ser Ser Thr Pro Pro Gin Thr Thr Leu Ser Thr Thr Lys Val 
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20 25 30 



Leu 


Lys 


He 
35 


Arg 


Tyr 


Pro 


Asp 


Asp 
40 


Gly 


Glu 


Trp 


Pro 


Gly 
45 


Ala 


Pro 


lie 


Asp 


Lys 
50 


Asp 


Gly 


Asp 


Gly 


Asn 
55 


Pro 


Glu 


Phe 


Tyr 


He 
60 


Glu 


He 


Asn 


Leu 


Trp 


Asn 


He 


Leu 


Asn 


Ala 


Thr 


Gly 


Phe 


Ala 


Glu 


Met 


Thr 


Tyr 


Asn 


Leu 


65 










70 










75 










80 


Thr 


Ser 


Gly 


Val 


Leu 


His 


Tyr 


Val 


Gin 


Gin 


Leu 


Asp 


Asn 


lie 


Val 


Leu 








85 










90 










95 




Arg 


Asp 


Arg 


Ser 
100 


Asn 


Trp 


Val 


His 


Gly 
105 


Tyr 


Pro 


Glu 


He 


Phe 
110 


Tyr 


Gly 


Asn 


Lys 


Pro 
115 


Trp 


Asn 


Ala 


Asn 


Tyr 
120 


Ala 


Thr 


Asp 


Gly 


Pro 
125 


He 


Pro 


Leu 


Pro 


Ser 
130 


Lys 


Val 


Ser 


Asn 


Leu 
135 


Thr 


Asp 


Phe 


Tyr 


Leu 
140 


Thr 


He 


Ser 


Tyr 


Lys 


Leu 


Glu 


Pro 


Lys 


Asn 


Gly 


Leu 


Pro 


lie 


Asn 


Phe 


Ala 


He 


Glu 


Ser 


145 










150 










155 










160 


Trp 


Leu 


Thr 


Arg 


Glu 


Ala 


Trp 


Arg 


Thr 


Thr 


Gly 


He 


Asn 


Ser 


Asp 


Glu 








165 










170 










175 




Gin 


Glu 


Val 


Met 
180 


He 


Trp 


lie 


Tyr 


Tyr 
185 


Asp 


Gly 


Leu 


Gin 


Pro 
190 


Ala 


Gly 


Ser 


Lys 


Val 


Lys 


Glu 


He 


Val 


Val 


Pro 


He 


lie 


Val 


Asn 


Gly 


Thr 


Pro 




195 










200 










205 








Val 


Asn 
210 


Ala 


Thr 


Phe 


Glu 


Val 
215 


Trp 


Lys 


Ala 


Asn 


He 

220 


Gly 


Trp 


Glu 


Tyr 


Val 


Ala 


Phe 


Arg 


He 


Lys 


Thr 


Pro 


He 


Lys 


Glu 


Gly 


Thr 


Val 


Thr 


lie 


225 










230 










235 










240 


Pro 


Tyr 


Gly 


Ala 


Phe 


He 


Ser 


Val 


Ala 


Ala 


Asn 


He 


Ser 


Ser 


Leu 


Pro 






245 










250 










255 




Asn 


Tyr 


Thr 


Glu 


Leu 


Tyr 


Leu 


Glu 


Asp 


Val 


Glu 


He 


Gly 


Thr 


Glu 


Phe 






260 










265 










270 






Gly 


Thr 


Pro 


Ser 


Thr 


Thr 


Ser 


Ala 


His 


Leu 


Glu 


Trp 


Trp 


He 


Thr 


Asn 




275 










280 










285 








lie 


Thr 
290 


Leu 


Thr 


Pro 


Leu 


Asp 
295 


Arg 


Pro 


Leu 


He 


Ser 
300 











<210> 56 
<211> 903 
<212> DNA 

<213> Pyrococcus furiosus 
<400> 56 

atctacttcg tggagaagta ccacacctcc gaggacaagt ccacctccaa cacctcctcc 60 
accccgccgc agaccaccct ctccaccacc aaggtgctca agatccgcta cccggacgac 120 
ggcgagtggc ccggcgcccc gatcgacaag gacggcgacg gcaacccgga gttctacatc 180 
gagatcaacc tctggaacat cctcaacgcc accggcttcg ccgagatgac ctacaacctc 240 
actagtggcg tgctccacta cgtgcagcag ctcgacaaca tcgtgctccg cgaccgctcc 300 
aactgggtgc acggctaccc ggaaatcttc tacggcaaca agccgtggaa cgccaactac 360 
gccaccgacg gcccgatccc gctcccgtcc aaggtgtcca acctcaccga cttctacctc 420 
accatctcct acaagctcga gccgaagaac ggtctcccga tcaacttcgc catcgagtcc 480 
tggctcaccc gcgaggcctg gcgcaccacc ggcatcaact ccgacgagca ggaggtgatg 540 
atctggatct actacgacgg cctccagccc gcgggctcca aggtgaagga gatcgtggtg 600 
ccgatcatcg tgaacggcac cccggtgaac gccaccttcg aggtgtggaa ggccaacatc 660 
ggctgggagt acgtggcctt ccgcatcaag accccgatca aggagggcac cgtgaccatc 720 
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ccgtacggcg ccttcatctc 
aagtacctcg aggacgtgga 
cacctcgagt ggtggatcac 
tag 



cgtggccgcc aacatctcct 
gatcggcacc gagttcggca 
caacatcacc ctcaccccgc 



ccctcccgaa ctacaccgag 780 
ccccgtccac cacctccgcc 840 
tcgaccgccc gctcatctcc 900 

903 



<210> 57 
<211> 387 
<212> PRT 

<213> Thermus flavus 



<400> 57 



Met 


Tyr 


Glu 


Pro 


Lys 


Pro 


Glu 


His 


Arg 


Phe 


Thr 


Phe 


Gly 


Leu 


Trp 


Thr 


1 






5 










10 










15 




Val 


Asp 


Asn 


Val 


Asp 


Arg 


Asp 


Pro 


Phe 


Gly 


Asp 


Thr 


Val 


Arg 


Glu 


Arg 








20 










25 










30 






Leu 


Asp 


Pro 


Val 


Tyr 


Val 


Val 


His 


Lys 


Leu 


Ala 


Glu 


Leu 


Gly 


Ala 


Tyr 






35 










40 










45 








Gly 


Val 


Asn 


Leu 


His 


Asp 


Glu 


Asp 


Leu 


He 


Pro 


Arg 


Gly 


Thr 


Pro 


Pro 




50 










55 










60 










Gin 


Glu 


Arg 


Asp 


Gin 


lie 


Val 


Arg 


Arg 


Phe 


Lys 


Lys 


Ala 


Leu 


Asp 


Glu 


65 










70 










75 










80 


Thr 


Val 


Leu 


Lys 


Val 


Pro 


Met 


Val 


Thr 


Ala 


Asn 


Leu 


Phe 


Ser 


Glu 


Pro 








85 










90 










95 




Ala 


Phe 


Arg 


Asp 


Gly 


Ala 


Ser 


Thr 


Thr 


Arg 


Asp 


Pro 


Trp 


Val 


Trp 


Ala 








100 










105 










110 






Tyr 


Ala 


Leu 


Arg 


Lys 


Ser 


Leu 


Glu 


Thr 


Met 


Asp 


Leu 


Gly 


Ala 


Glu 


Leu 






115 










120 










125 








Gly 


Ala 


Glu 


lie 


Tyr 


Met 


Phe 


Trp 


Met 


Val 


Arg 


Glu 


Arg 


Ser 


Glu 


Val 


130 










135 










140 










Glu 


Ser 


Thr 


Asp 


Lys 


Thr 


Arg 


Lys 


Val 


Trp 


Asp 


Trp 


Val 


Arg 


Glu 


Thr 


145 










150 










155 










160 


Leu 


Asn 


Phe 


Met 


Thr 


Ala 


Tyr 


Thr 


Glu 


Asp 


Gin 


Gly 


Tyr 


Gly 


Tyr 


Arg 










165 










170 










175 




Phe 


Ser 


Val 


Glu 


Pro 


Lys 


Pro 


Asn 


Glu 


Pro 


Arg 


Gly 


Asp 


He 


Tyr 


Phe 








180 










185 










190 






Thr 


Thr 


Val 


Gly 


Ser 


Met 


Leu 


Ala 


Leu 


He 


His 


Thr 


Leu 


Asp 


Arg 


Pro 






195 










200 










205 








Glu 


Arg 


Phe 


Gly 


Leu 


Asn 


Pro 


Glu 


Phe 


Ala 


His 


Glu 


Thr 


Met 


Ala 


Gly 




210 










215 










220 










Leu 


Asn 


Phe 


Asp 


His 


Ala 


Val 


Ala 


Gin 


Ala 


Val 


Asp 


Ala 


Gly 


Lys 


Leu 


225 










230 










235 










240 


Phe 


His 


lie 


Asp 


Leu 


Asn 


Asp 


Gin 


Arg 


Met 


Ser 


Arg 


Phe 


Asp 


Gin 


Asp 










245 










250 










255 




Leu 


Arg 


Phe 


Gly 


Ser 


Glu 


Asn 


Leu 


Lys 


Ala 


Gly 


Phe 


Phe 


Leu 


Val 


Asp 








260 










265 










270 






Leu 


Leu 


Glu 


Ser 


Ser 


Gly 


Tyr 


Gin 


Gly 


Pro 


Arg 


His 


Phe 


Glu 


Ala 


His 






275 










280 










285 








Ala 


Leu 


Arg 


Thr 


Glu 


Asp 


Glu 


Glu 


Gly 


Val 


Trp 


Thr 


Phe 


Val 


Arg 


Val 




290 










295 










300 










Cys 


Met 


Arg 


Thr 


Tyr 


Leu 


lie 


He 


Lys 


Val 


Arg 


Ala 


Glu 


Thr 


Phe 


Arg 


305 










310 










315 










320 


Glu 


Asp 


Pro 


Glu 


Val 


Lys 


Glu 


Leu 


Leu 


Ala 


Ala 


Tyr 


Tyr 


Gin 


Glu 


Asp 








325 










330 










335 




Pro 


Ala 


Thr 


Leu 


Ala 


Leu 


Leu 


Asp 


Pro 


Tyr 


Ser 


Arg 


Glu 


Lys 


Ala 


Glu 
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340 

Ala Leu Lys Arg Ala Glu Leu Pro 
355 360 
Tyr Ala Leu Glu Arg Leu Asp Gin 

370 375 
Val Arg Gly 
385 



345 350 
Leu Glu Thr Lys Arg Arg Arg Gly 

365 

Leu Ala Val Glu Tyr Leu Leu Gly 

380 



<210> 58 
<211> 978 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 58 

atggggaaga acggcaacct gtgctgcttc tctctgctgc tgcttcttct cgccgggttg 60 
gcgtccggcc atcaaatcta cttcgtggag aagtaccaca cctccgagga caagtccacc 120 
tccaacacct cctccacccc gccgcagacc accctctcca ccaccaaggt gctcaagatc 180 
cgctacccgg acgacggtga gtggcccggc gccccgatcg acaaggacgg cgacggcaac 24 0 
ccggagttct acatcgagat caacctctgg aacatcctca acgccaccgg cttcgccgag 300 
atgacctaca acctcactag tggcgtgctc cactacgtgc agcagctcga caacatcgtg 360 
ctccgcgacc gctccaactg ggtgcacggc tacccggaaa tcttctacgg caacaagccg 420 
tggaacgcca actacgccac cgacggcccg atcccgctcc cgtccaaggt gtccaacctc 480 
accgacttct acctcaccat ctcctacaag ctcgagccga agaacggtct cccgatcaac 540 
ttcgccatcg agtcctggct cacccgcgag gcctggcgca ccaccggcat caactccgac 600 
gagcaggagg tgatgatctg gatctactac gacggcctcc agcccgcggg ctccaaggtg 660 
aaggagatcg tggtgccgat catcgtgaac ggcaccccgg tgaacgccac cttcgaggtg 720 
tggaaggcca acatcggctg ggagtacgtg gccttccgca tcaagacccc gatcaaggag 780 
ggcaccgtga ccatcccgta cggcgccttc atctccgtgg ccgccaacat ctcctccctc 840 
ccgaactaca ccgagaagta cctcgaggac gtggagatcg gcaccgagtt cggcaccccg 900 
tccaccacct ccgcccacct cgagtggtgg atcaccaaca tcaccctcac cccgctcgac 960 
cgcccgctca tctcctag 978 

<210> 59 
<211> 1920 
<2 12> DNA 

<213> Aspergillus niger 
<400> 59 

atgtccttcc gctccctcct cgccctctcc ggcctcgtgt gcaccggcct cgccaacgtg 60 
atctccaagc gcgccaccct cgactcctgg ctctccaacg aggccaccgt ggcccgcacc 120 
gccatcctca acaacatcgg cgccgacggc gcctgggtgt ccggcgccga ctccggcatc 180 
gtggtggcct ccccgtccac cgacaacccg gactacttct acacctggac ccgcgactcc 240 
ggcctcgtgc tcaagaccct cgtggacctc ttccgcaacg gcgacacctc cctcctctcc 300 
accatcgaga actacatctc cgcccaggcc atcgtgcagg gcatctccaa cccgtccggc 360 
gacctctcct ccggcgccgg cctcggcgag ccgaagttca acgtggacga gaccgcctac 420 
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accggctcct ggggccgccc 
ggcttcggcc agtggctcct 
ccgctcgtgc gcaacgacct 
ctctgggagg aggtgaacgg 
5 gtggagggct ccgccttcgc 
gccccggaga tcctctgcta 
ttcgactcct cccgctccgg 
gacccggagg ccgcctgcga 
aaccacaagg aggtggtgga 

10 gactccgagg ccgtggccgt 
tggttcctct gcaccctcgc 
aagcagggct ccctcgaggt 
gacgccgcca ccggcaccta 
gtgaagacct tcgccgacgg 

15 tccatgtccg agcagtacga 
tggtcctacg ccgccctcct 
tggggcgaga cctccgcctc 
acctactcct ccgtgaccgt 
accaccgcca ccccgaccgg 

20 gcctccaaga cctccacctc 
gtgaccttcg acctcaccgc 
atctcccagc tcggcgactg 
acctcctccg acccgctctg 
tacaagttca tccgcatcga 

25 gagtacaccg tgccgcaggc 



gcagcgcgac ggcccggccc 
cgacaacggc tacacctcca 
ctcctacgtg gcccagtact 
ctcctccttc ttcaccatcg 
caccgccgtg ggctcctcct 
cctccagtcc ttctggaccg 
caaggacgcc aacaccctcc 
cgactccacc ttccagccgt 
ctccttccgc tccatctaca 
gggccgctac ccggaggaca 
cgccgccgag cagctctacg 
gaccgacgtg tccctcgact 
ctcctcctcc tcctccacct 
cttcgtgtcc atcgtggaga 
caagtccgac ggcgagcagc 
caccgccaac aaccgccgca 
ctccgtgccg ggcacctgcg 
gacctcctgg ccgtccatcg 
ctccggctcc gtgacctcca 
cacctcctcc acctcctgca 
caccaccacc tacggcgaga 
ggagacctcc gacggcatcg 
gtacgtgacc gtgaccctcc 
gtccgacgac tccgtggagt 
ctgcggcacc tccaccgcca 



tccgcgccac cgccatgatc 4 80 
ccgccaccga catcgtgtgg 540 
ggaaccagac cggctacgac 600 
ccgtgcagca ccgcgccctc 660 
gctcctggtg cgactcccag 720 
gctccttcat cctcgccaac 780 
tcggctccat ccacaccttc 840 
gctccccgcg cgccctcgcc 900 
ccctcaacga cggcctctcc 960 
cctactacaa cggcaacccg 1020 
acgccctcta ccagtgggac 1080 
tcttcaaggc cctctactcc 1140 
actcctccat cgtggacgcc 1200 
cccacgccgc ctccaacggc 1260 
tctccgcccg cgacctcacc 1320 
actccgtggt gccggcctcc 1380 
ccgccacctc cgccatcggc 1440 
tggccaccgg cggcaccacc 1500 
cctccaagac caccgccacc 1560 
ccaccccgac cgccgtggcc 1620 
acatctacct cgtgggctcc 1680 
ccctctccgc cgacaagtac 1740 
cggccggcga gtccttcgag 1800 
gggagtccga cccgaaccgc 1860 
ccgtgaccga cacctggcgc 1920 



<210> 60 

<211> 6 

30 <212> PRT 

<213> Artificial Sequence 



35 



<220> 

<223> synthetic 
<400> 60 

Ser Glu Lys Asp Glu Leu 



40 

<210> 61 

<211> 561 

<212> DNA 

45 <213> Artificial Sequence 

<220> 

<223> Xylanase BD7436 

50 

<220> 

<221> CDS 

<222> (1) . . (561) 

55 <400> 61 

atg get age acc ttc tac tgg cat ttg tgg acc gac ggc ate ggc acc 48 
Met Ala Ser Thr Phe Tyr Trp His Leu Trp Thr Asp Gly lie Gly Thr 
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1 

gtg aac get 
Val Asn Ala 



aac tgc ggc 
Asn Cys Gly 
35 

acc agg gtg 
Thr Arg Val 
50 

get tac ttg 
Ala Tyr Leu 
65 

tac gtg gtg 
Tyr Val Val 



ggc acc gtg 
Gly Thr Val 



agg acc aac 
Arg Thr Asn 
115 

ttc tgg age 
Phe Trp Ser 
130 

ate acc ttc 
lie Thr Phe 
145 

ttg ggc age 
Leu Gly Ser 



age age ggc 
Ser Ser Gly 



5 

acc aac ggc 
Thr Asn Gly 
20 

aac ttc gtg 
Asn Phe Val 



ate aac tac 
lie Asn Tyr 



get ttg tac 
Ala Leu Tyr 

70 

gac age tgg 
Asp Ser Trp 
85 

acc age gac 
Thr Ser Asp 
100 

get cca age 
Ala Pro Ser 



gtg agg caa 
Val Arg Gin 



age aac cat 
Ser Asn His 
150 

age tgg age 
Ser Trp Ser 
165 

tac age aac 
Tyr Ser Asn 
180 



age gac ggc 
Ser Asp Gly 
25 

gtg ggc aag 
Val Gly Lys 
40 

aac get cat 
Asn Ala His 
55 

ggc tgg acc 
Gly Trp Thr 



ggc acc tac 
Gly Thr Tyr 



ggc ggc acc 
Gly Gly Thr 
105 

ate gac ggc 
lie Asp Gly 
120 

age aag agg 
Ser Lys Arg 
135 

gtg aac get 
Val Asn Ala 



tac caa gtg 
Tyr Gin Val 



gtg acc gtg 
Val Thr Val 
185 



10 

aac tac age 
Asn Tyr Ser 



ggc tgg acc 
Gly Trp Thr 



get ttc age 
Ala Phe Ser 
60 

agg aac age 
Arg Asn Ser 
75 

agg cca acc 
Arg Pro Thr 
90 

tac gac ate 
Tyr Asp lie 



aac aac acc 
Asn Asn Thr 



cca ate ggc 
Pro lie Gly 
140 

tgg aag age 
Trp Lys Ser 
155 

ttg get acc 
Leu Ala Thr 
170 

tgg tag 
Trp 



15 

gtg age tgg 
Val Ser Trp 
30 

acc ggc age 
Thr Gly Ser 
45 

gtg gtg ggc 
Val val Gly 



ttg ate gag 
Leu lie Glu 



ggc acc tac 
Gly Thr Tyr 
95 

tac acc acc 
Tyr Thr Thr 
110 

acc ttc acc 
Thr Phe Thr 
125 

acc aac aac 
Thr Asn Asn 



aag ggc atg 
Lys Gly Met 



gag ggc tac 
Glu Gly Tyr 
175 



age 96 
Ser 



get 144 
Ala 



aac 192 
Asn 



tac 240 

Tyr 

80 

aag 288 
Lys 

acc 336 
Thr 



caa 384 
Gin 



acc 432 
Thr 



aac 480 

Asn 

160 

caa 528 
Gin 



561 



<210> 62 

<211> 186 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 62 
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Met Ala Ser Thr Phe Tyr Trp His Leu Trp Thr Asp Gly He Gly Thr 
15 10 15 

5 Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
10 35 40 45 

Thr Arg Val He Asn Tyr Asn Ala His Ala Phe Ser Val Val Gly Asn 
50 55 60 

15 

Ala Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu He Glu Tyr 
65 70 75 80 

20 

Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 

25 Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp He Tyr Thr Thr Thr 

100 105 HO 

Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
30 115 120 125 

Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 135 140 

35 

He Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
145 150 155 160 

40 

Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 



45 Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

180 185 



<210> 63 

50 <211> 561 

<212> DNA 

<213> Artificial Sequence 



<220> 

55 <223> Xylanase BD6002A 
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<220> 

<221> CDS 

<222> (1) . . (561) 

5 <400> 63 

atg get age acc gac tac tgg caa aac tgg acc gac ggc ggc ggc acc 48 

Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 
15 10 15 

10 gtg aac get acc aac ggc age gac ggc aac tac age gtg age tgg age 96 
Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

aac tgc ggc aac ttc gtg gtg ggc aag ggc tgg acc acc ggc age get 144 
15 Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 

35 40 45 

acc agg gtg ate aac tac aac get ggc get ttc age cca age ggc aac 192 
Thr Arg Val lie Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
20 50 55 60 

ggc tac ttg get ttg tac ggc tgg acc agg aac age ttg ate gag tac 240 

Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu lie Glu Tyr 
65 70 75 80 

25 

tac gtg gtg gac age tgg ggc acc tac agg cca acc ggc acc tac aag 288 

Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 

30 ggc acc gtg acc age gac ggc ggc acc tac gac ate tac acc acc acc 336 
Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp He Tyr Thr Thr Thr 

100 105 110 

agg acc aac get cca age ate gac ggc aac aac acc acc ttc acc caa 384 
35 Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 

115 120 125 

ttc tgg age gtg agg caa age aag agg cca ate ggc acc aac aac acc 432 
Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
40 130 135 140 

ate acc ttc age aac cat gtg aac get tgg aag age aag ggc atg aac 480 

He Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 

145 150 155 160 

45 

ttg ggc age age tgg age tac caa gtg ttg get acc gag ggc tac caa 52 8 

Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 

50 age age ggc tac age aac gtg acc gtg tgg tag 561 
Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

180 185 



55 <210> 64 
<211> 186 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

5 

<400> 64 

Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 
1 5 10 15 

10 

Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

15 

Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 

20 Thr Arg Val lie Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
50 55 60 

Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu lie Glu Tyr 
25 65 70 75 80 



30 



Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 

Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp He Tyr Thr Thr Thr 

100 105 HO 

35 

Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 

40 Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 135 140 

He Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
45 145 150 155 160 

Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 

50 

Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

180 185 

55 

<210> 65 
<211> 561 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Xylanase BD6002B 



<220> 

<221> CDS 



48 



10 <222> (1) . . (561) 
<400> 65 

atg gcc tec acc gac tac tgg cag aac tgg acc gac ggc ggc ggc acc 

Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 

15 1 5 10 15 

gtg aac gcc acc aac ggc tec gac ggc aac tac tec gtg tec tgg tec 96 

Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

20 

aac tgc ggc aac ttc gtg gtg ggc aag ggc tgg acc acc ggc tec gcc 144 

Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 

25 acc cgc gtg ate aac tac aac gcc ggc gcc ttc tec ccg tec ggc aac 192 

Thr Arg Val lie Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
50 55 60 

ggc tac etc gcc etc tac ggc tgg acc cgc aac tec etc ate gag tac 240 

30 Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu lie Glu Tyr 

65 70 75 80 

tac gtg gtg gac tec tgg ggc acc tac cgc ccg acc ggc acc tac aag 288 

Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

35 85 90 95 

ggc acc gtg acc tec gac ggc ggc acc tac gac ate tac acc acc acc 336 

Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp He Tyr Thr Thr Thr 

100 105 110 

40 

cgc acc aac gcc ccg tec ate gac ggc aac aac acc acc ttc acc cag 384 

Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 

45 ttc tgg tec gtg cgc cag tec aag cgc ccg ate ggc acc aac aac acc 432 

Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 135 140 

ate acc ttc tec aac cac gtg aac gcc tgg aag tec aag ggc atg aac 480 

50 He Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 

145 150 155 160 

etc ggc tec tec tgg tec tac cag gtg etc gcc acc gag ggc tac cag 528 

Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

55 165 170 175 

tec tec ggc tac tec aac gtg acc gtg tgg tga 561 
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Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

180 185 



5 <210> 66 

<211> 186 

<212> PRT 

<213> Artificial Sequence 

10 <220> 

<223> Synthetic Construct 

<400> 66 

15 Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 
15 10 15 

Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 
20 20 25 30 

Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 

25 

Thr Arg Val lie Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
50 55 60 

30 

Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu lie Glu Tyr 
65 70 75 80 

35 Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 

Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp He Tyr Thr Thr Thr 
40 100 105 110 

Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 

45 

Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 135 140 

50 

He Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
145 150 155 160 

55 Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 
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Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 



<210> 67 

<211> 2071 

<212> DNA 

<213> Oryza sativa 

<220> 

<221> misc_feature 
<222> (1) . - (2071) 
<223> Promoter 

<400> 67 

tccatgctgt cctactactt gcttcatccc cttctacatt ttgttctggt ttttggcctg 60 

catttcggat catgatgtat gtgatttcca atctgctgca atatgaatgg agactctgtg 120 

ctaaccatca acaacatgaa atgcttatga ggcctttgct gagcagccaa tcttgcctgt 180 

gtttatgtct tcacaggccg aattcctctg ttttgttttt caccctcaat atttggaaac 240 

atttatctag gttgtttgtg tccaggccta taaatcatac atgatgttgt cgtattggat 300 

gtgaatgtgg tggcgtgttc agtgccttgg att tgagttt gatgagagtt gcttctgggt 360 

caccactcac cattatcgat gctcctcttc agcataaggt aaaagtcttc cctgtttacg 420 

ttattttacc cactatggtt gcttgggttg gttttttcct gattgcttat gccatggaaa 480 

gtcatttgat atgttgaact tgaattaact gtagaattgt atacatgttc catttgtgtt 540 

gtacttcctt cttttctatt agtagcctca gatgagtgtg aaaaaaacag attatataac 600 

ttgccctata aatcatttga aaaaaatatt gtacagtgag aaattgatat atagtgaatt 660 

tttaagagca tgttttccta aagaagtata tattttctat gtacaaaggc cattgaagta 720 

attgtagata caggataatg tagacttttt ggacttacac tgctaccttt aagtaacaat 780 

catgagcaat agtgttgcaa tgatatttag gctgcattcg tttactctct tgatttccat 840 

gagcacgctt cccaaactgt taaactctgt gttttttgcc aaaaaaaaat gcataggaaa 900 

gttgctttta aaaaatcata tcaatccatt ttttaagtta tagctaatac ttaattaatc 960 

atgcgctaat aagtcactct gtttttcgta ctagagagat tgttttgaac cagcactcaa 1020 

gaacacagcc ttaacccagc caaataatgc tacaacctac cagtccacac ctcttgtaaa 1080 

gcatttgttg catggaaaag ctaagatgac agcaacctgt tcaggaaaac aactgacaag 1140 

gtcataggga gagggagctt ttggaaaggt gccgtgcagt tcaaacaatt agttagcagt 1200 
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agggtgttgg 


tttttgetea 


cagcaataag 


aagttaatca 


tggtgtaggc 


aacccaaata 


1260 




aaacaccaaa 


atatgeacaa 


ggcagtttgt 


tgtattctgt 


agtacagaca 


aaactaaaag 


1320 


5 


taatgaaaga 


agatgtggtg 


ttagaaaagg 


aaacaatatc 


atgagtaatg 


tgtgggcatt 


1380 




atgggaccac 


gaaataaaaa 


gaacattttg 


atgagtcgtg 


tatcctcgat 


gagectcaaa 


1440 


10 


agttctctca 


ccccggataa 


gaaaccctta 


agcaatgtgc 


aaagtttgca 


ttctccactg 


1500 


acataatgca 


aaataagata 


tcatcgatga 


catagcaact 


catgeatcat 


atcatgcctc 


1560 




tctcaaccta 


ttcattccta 


ctcatctaca 


taagtatctt 


cagctaaatg 


ttagaacata 


1620 


15 


aacccataag 


teaegtt tga 


tgagtattag 


gcgtgacaca 


tgacaaatca 


cagactcaag 


1680 




caagataaag 


caaaatgatg 


tgtacataaa 


actccagagc 


tatatgtcat 


at tgcaaaaa 


1740 


20 


gaggagagct 
agaggagggc 


tataagacaa 
tttacattat 


ggcatgactc 
ccatgtcata 


acaaaaattc 
t tgcaaaaga 


atttgeett t 
aagagagaaa 


cgtgtcaaaa 
gaacaacaca 


1800 
1860 




atgctgcgtc 


aat tatacat 


atctgtatgt 


ccatcat ta t 


tcatccacct 


ttcgtgtacc 


1920 


25 


acacttcata 


tatcatgagt 


cacttcatgt 


ctggacatta 


acaaactcta 


tcttaacat t 


1980 




tagatgcaag 


agectttate 


tcactataaa 


tgcacgatga 


tttctcattg 


tttctcacaa 


2040 


30 


aaagcat tea 


gttcattagt 


cctacaacaa 


c 






2071 


35 


<210> 68 
<211> 79 
<212> PRT 
<213> Zea 


mays 













<220> 

<221> SIGNAL 

40 <222> (1) . . (79) 

<223> Maize waxy signal sequence. 

<400> 68 

45 Met Leu Ala Ala Leu Ala Thr Ser Gin Leu Val Ala Thr Arg Ala Gly 
15 10 15 

Leu Gly Val Pro Asp Ala Ser Thr Phe Arg Arg Gly Ala Ala Gin Gly 
50 20 25 3 0 



Leu Arg Gly Ala Arg Ala Ser Ala Ala Ala Asp Thr Leu Ser Met Arg 
35 40 45 

55 

Thr Ser Ala Arg Ala Ala Pro Arg His Gin His Gin Gin Ala Arg Arg 
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50 55 60 



Gly Ala Arg Phe Pro Ser Leu Val Val Cys Ala Ser Ala Gly Ala 
65 " 70 75 



<210> 69 

<211> 1005 

10 <212> DNA 

<213> Artificial Sequence 



15 



<220 > 

<223> Synthetic Bromelain Sequence 



<220> 

<221> CDS 



<222> (1) . . (1005) 
20 <223> Synthetic Bromelain 

<400> 69 

atg gcc tgg aag gtg cag gtg gtg ttc etc ttc etc ttc etc tgc gtg 48 

Met Ala Trp Lys Val Gin Val Val Phe Leu Phe Leu Phe Leu Cys Val 
25 15 10 15 

atg tgg 9 CC tec ccg tec gcc gcc tec gcg gac gag ccg tec gac ccg 96 

Met Trp Ala Ser Pro Ser Ala Ala Ser Ala Asp Glu Pro Ser Asp Pro 

20 25 30 

30 

atg atg aag cgc ttc gag gag tgg atg gtg gag tac ggc cgc gtg tac 144 

Met Met Lys Arg Phe Glu Glu Trp Met Val Glu Tyr Gly Arg Val Tyr 
35 40 45 

35 aag gac aac gac gag aag atg cgc cgc ttc cag ate ttc aag aac aac 192 

Lys Asp Asn Asp Glu Lys Met Arg Arg Phe Gin lie Phe Lys Asn Asn 
50 55 60 

gtg aac cac ate gag acc ttc aac tec cgc aac gag aac tec tac acc 240 

40 Val Asn His He Glu Thr Phe Asn Ser Arg Asn Glu Asn Ser Tyr Thr 
65 70 75 80 

etc ggc ate aac cag ttc acc gac atg acc aac aac gag ttc ate gcc 288 

Leu Gly He Asn Gin Phe Thr Asp Met Thr Asn Asn Glu Phe He Ala 

45 85 90 95 

cag tac acc ggc ggc ate tec cgc ccg etc aac ate gag cgc gag ccg 336 

Gin Tyr Thr Gly Gly He Ser Arg Pro Leu Asn He Glu Arg Glu Pro 

100 105 HO 



50 

gtg gtg tec ttc gac gac gtg gac ate tec gcc gtg ccg cag tec ate 384 

Val Val Ser Phe Asp Asp Val Asp He Ser Ala Val Pro Gin Ser He 

115 120 125 

55 gac tgg cgc gac tac ggc gcc gtg acc tec gtg aag aac cag aac ccg 432 

Asp Trp Arg Asp Tyr Gly Ala Val Thr Ser Val Lys Asn Gin Asn Pro 

130 135 140 
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tgc ggc gcc 
Cys Gly Ala 
145 

tac aag ate 
Tyr Lys lie 



etc gac tgc 
Leu Asp Cys 



gcc ttc gag 
Ala Phe Glu 
195 

tac ccg tac 
Tyr Pro Tyr 
210 

aac tec gcc 
Asn Ser Ala 
225 

tec tec atg 
Ser Ser Met 



gac gcc aac 
Asp Ala Asn 



ccg tgc ggc 
Pro Cys Gly 
275 

cag gac tec 
Gin Asp Ser 
290 

gcc ggc tac 
Ala Gly Tyr 
305 

tgc ggc ate 
Cys Gly lie 



tgc tgg gcc 
Cys Trp Ala 
150 

aag aag ggc 
Lys Lys Gly 
165 

gcc aag ggc 
Ala Lys Gly 
180 

ttc ate ate 
Phe lie lie 



aag gcc gcc 
Lys Ala Ala 



tac ate ace 
Tyr lie Thr 
230 

atg tac gcc 
Met Tyr Ala 
245 

gcc aac ttc 
Ala Asn Phe 
260 

ace tec etc 
Thr Ser Leu 



ate ate tac 
lie lie Tyr 



ate cgc atg 
lie Arg Met 
310 

gcc ate gac 
Ala lie Asp 
325 



ttc gcc gcc 
Phe Ala Ala 



ate etc gag 
lie Leu Glu 



tac ggc tgc 
Tyr Gly Cys 
185 

tec aac aag 
Ser Asn Lys 
200 

aag ggc ace 
Lys Gly Thr 
215 

ggc tac gcc 
Gly Tyr Ala 



gtg tec aag 
Val Ser Lys 



cag tac tac 
Gin Tyr Tyr 
265 

aac cac gcc 
Asn His Ala 
280 

ccg aag aag 
Pro Lys Lys 
295 

gcc cgc gac 
Ala Arg Asp 



ccg etc tac 
Pro Leu Tyr 



ate gcc acc 
lie Ala Thr 
155 

ccg etc tec 
Pro Leu Ser 
170 

aag ggc ggc 
Lys Gly Gly 



ggc gtg gcc 
Gly Val Ala 



tgc aag acc 
Cys Lys Thr 
220 

cgc gtg ccg 
Arg Val Pro 
235 

cag ccg ate 
Gin Pro lie 
250 

aag tec ggc 
Lys Ser Gly 



gtg acc gcc 
Val Thr Ala 



tgg ggc gcc 

Trp Gly Ala 
300 

gtg tec tec 
Val Ser Ser 
315 

ccg acc etc 
Pro Thr Leu 
330 



gtg gag tec 
Val Glu Ser 



gag cag cag 
Glu Gin Gin 
175 

tgg gag ttc 

Trp Glu Phe 
190 

tec ggc gcc 
Ser Gly Ala 
205 

gac ggc gtg 
Asp Gly Val 



cgc aac aac 
Arg Asn Asn 



acc gtg gcc 
Thr Val Ala 
255 

gtg ttc aac 
Val Phe Asn 
270 

ate ggc tac 
lie Gly Tyr 
285 

aag tgg ggc 
Lys Trp Gly 



tec tec ggc 
Ser Ser Gly 



gag gag tag 
Glu Glu 



ate 480 

He 

160 

gtg 528 
Val 



cgc 576 
Arg 



ate 624 
He 



ccg 672 
Pro 



gag 72 0 

Glu 

240 

gtg 768 
Val 



ggc 816 
Gly 



ggc 864 
Gly 



gag 912 
Glu 



ate 960 

He 

320 

1005 



<210> 70 

<211> 334 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
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<400> 70 

Met Ala Trp Lys Val Gin Val Val Phe Leu Phe Leu Phe Leu Cys Val 
5 1 5 10 15 



Met Trp Ala Ser Pro Ser Ala Ala Ser Ala Asp Glu Pro Ser Asp Pro 

20 25 30 

10 

Met Met Lys Arg Phe Glu Glu Trp Met Val Glu Tyr Gly Arg Val Tyr 
35 40 45 

15 

Lys Asp Asn Asp Glu Lys Met Arg Arg Phe Gin lie Phe Lys Asn Asn 
50 55 60 



20 Val Asn His lie Glu Thr Phe Asn Ser Arg Asn Glu Asn Ser Tyr Thr 
65 70 75 80 



Leu Gly He Asn Gin Phe Thr Asp Met Thr Asn Asn Glu Phe He Ala 
25 85 90 95 



Gin Tyr Thr Gly Gly He Ser Arg Pro Leu Asn He Glu Arg Glu Pro 

100 105 110 

30 

Val Val Ser Phe Asp Asp Val Asp He Ser Ala Val Pro Gin Ser He 
115 120 125 

35 

Asp Trp Arg Asp Tyr Gly Ala Val Thr Ser Val Lys Asn Gin Asn Pro 
130 13 5 14 0 



40 Cys Gly Ala Cys Trp Ala Phe Ala Ala He Ala Thr Val Glu Ser He 
145 150 155 160 



Tyr Lys He Lys Lys Gly He Leu Glu Pro Leu Ser Glu Gin Gin Val 
45 * ' 165 170 175 



Leu Asp Cys Ala Lys Gly Tyr Gly Cys Lys Gly Gly Trp Glu Phe Arg 

180 185 190 

50 

Ala Phe Glu Phe lie lie Ser Asn Lys Gly Val Ala Ser Gly Ala lie 
195 200 205 

55 

Tyr Pro Tyr Lys Ala Ala Lys Gly Thr Cys Lys Thr Asp Gly Val Pro 
210 215 220 
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Asn Ser Ala Tyr lie Thr Gly Tyr Ala Arg Val Pro Arg Asn Asn Glu 
225 230 235 240 



Ser Ser Met Met Tyr Ala Val Ser Lys Gin Pro lie Thr Val Ala Val 

245 250 255 



Asp Ala Asn Ala Asn Phe Gin Tyr Tyr Lys Ser Gly Val Phe Asn Gly 

260 265 270 



Pro Cys Gly Thr Ser Leu Asn His Ala Val Thr Ala He Gly Tyr Gly 
275 280 285 



Gin Asp Ser He He Tyr Pro Lys Lys Trp Gly Ala Lys Trp Gly Glu 
290 295 300 



Ala Gly Tyr He Arg Met Ala Arg Asp Val Ser Ser Ser Ser Gly He 
305 ~ 310 315 320 



Cys Gly He Ala He Asp Pro Leu Tyr Pro Thr Leu Glu Glu 

32 5 330 



<210> 


71 


<211> 


78 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Bromealin signal sequence 


<400> 


71 



atggcctgga aggtgcaggt ggtgttcctc ttcctcttcc tctgcgtgat gtgggcctcc 60 
ccgtccgccg cctccgcc 78 



<210> 


72 


<211> 


26 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Bromealin signal peptide 


<400> 


72 



Met Ala Trp Lys Val Gin Val Val Phe Leu Phe Leu Phe Leu Cys Val 
15 10 15 
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10 



Met Trp Ala Ser Pro Ser Ala Ala Ser Ala 

20 25 



<210> 73 

<211> 1050 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> pSYNHOOO 

<400> 73 



15 


atggcctgga 


aggtgcaggt 


ggtgttcctc 


ttcctcttcc 


tctgcgtgat 


gtgggcctcc 


60 




ccgtccgccg 


cctccgcgga 


cgagccgtcc 


gacccgatga 


tgaagcgctt 


cgaggagtgg 


120 


20 


atggtggagt 


acggccgcgt 


gtacaaggac 


aacgacgaga 


agatgcgccg 


ct tccagatc 


180 


ttcaagaaca 


acgtgaacca 


catcgagacc 


ttcaactccc 


gcaacgagaa 


ctcctacacc 


240 




ctcggcatca 


accagttcac 


cgacatgacc 


aacaacgagt 


tcatcgccca 


gtacaccggc 


300 


25 


ggcatctccc 


gcccgctcaa 


catcgagcgc 


gagccggtgg 


tgtccttcga 


cgacgtggac 


360 




atctccgccg 


tgccgcagtc 


catcgactgg 


cgcgactacg 


gcgccgtgac 


ctccgtgaag 


420 


30 


aaccagaacc 


cgtgcggcgc 


ctgctgggcc 


ttcgccgcca 


tcgccaccgt 


ggagtccatc 


480 


tacaagatca 


agaagggcat 


cctcgagccg 


ctctccgagc 


agcaggtgct 


cgactgcgcc 


540 




aagggctacg 


gctgcaaggg 


cggctgggag 


ttccgcgcct 


tcgagttcat 


catctccaac 


600 


35 


aagggcgtgg 


cctccggcgc 


catctacccg 


tacaaggccg 


ccaagggcac 


ctgcaagacc 


660 




gacggcgtgc 


cgaactccgc 


ctacatcacc 


ggctacgccc 


gcgtgccgcg 


caacaacgag 


720 


40 


tcctccatga 


tgtacgccgt 


gtccaagcag 


ccgatcaccg 


tggccgtgga 


cgccaacgcc 


780 


aacttccagt 


actacaagtc 


cggcgtgttc 


aacggcccgt 


gcggcacctc 


cctcaaccac 


840 




gccgtgaccg 


ccatcggcta 


cggccaggac 


tccatcatct 


acccgaagaa 


gtggggcgcc 


900 


45 


aagtggggcg 


aggccggcta 


catccgcatg 


gcccgcgacg 


tgtcctcctc 


ctccggcatc 


960 




tgcggcatcg 


ccatcgaccc 


gctctacccg 


accctcgagg 


aggtgttcgc 


cgaggccatc 


1020 




gccgccaact 


ccaccctcgt 


ggccgagtag 








1050 



50 



<210> 74 

<211> 1067 

<212> DNA 

55 <213> Artificial Sequence 

<220> 



244 



WO 2005/096804 PCT/US2004/007182 
<223> pSYN11589 



<400> 74 

tggcctggaa 


ggtgcaggtg 


gtgttcctct 


tcctcttcct 


ctgcgtgatg 


tgggcctccc 


60 


cgtccgccgc 


ctccgcctcc 


tcctcctcct 


tcgccgactc 


caacccgatc 


cgcccggtga 


120 


ccgaccgcgc 


cgcctccacc 


gaegagcegt 


ccgacccgat 


gatgaagege 


ttcgaggagt 


180 


ggatggtgga 


gtacggccgc 


gtgtacaagg 


acaacgacga 


gaagatgege 


cgcttccaga 


240 


tcttcaagaa 


caacgtgaac 


cacatcgaga 


ccttcaactc 


ccgcaacgag 


aactcctaca 


300 


ccctcggcat 


caaccagttc 


accgacatga 


ccaacaacga 


gttcatcgcc 


cagtacaccg 


360 


gcggcatctc 


ccgcccgctc 


aacatcgagc 


gcgagccggt 


ggtgtccttc 


gacgacgtgg 


420 


acatctccgc 


cgtgccgcag 


tccatcgact 


ggegegacta 


cggcgccgtg 


acctccgtga 


480 


agaaccagaa 


cccgtgcggc 


gcctgctggg 


ccttcgccgc 


catcgccacc 


gtggagtcca 


540 


tctacaagat 


caagaagggc 


atcctcgagc 


cgctctccga 


gcagcaggtg 


ctcgactgcg 


600 


ccaagggcta 


cggctgcaag 


ggcggctggg 


agttccgcgc 


cttcgagttc 


atcatctcca 


660 


acaagggcgt 


ggcctccggc 


gccatctacc 


egtacaagge 


cgccaagggc 


acctgeaaga 


720 


ccgacggcgt 


gccgaac tec 


gcctacatca 


ccggctacgc 


ccgcgtgccg 


cgcaacaacg 


780 






atatccaaQC 


agccgatcac 


cgtggccgtg 


gacgccaacg 


840 


ccaacttcca 


gtactacaag 


tccggcgtgt 


tcaacggccc 


gtgcggcacc 


tccctcaacc 


900 


acgccgtgac 


cgccatcggc 


taeggecagg 


actccatcat 


ctacccgaag 


aagtggggcg 


960 


ccaagtgggg 


cgaggccggc 


tacatccgca 


tggcccgcga 


cgtgtcctcc 


tcctccggca 


1020 


tctgcggcat 


cgccatcgac 


ccgctctacc 


cgaccctcga 


ggagtag 




1067 



<210> 75 
<211> 1023 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pSYN11587 Sequence 
<400> 75 

atggcctgga aggtgcaggt ggtgttcctc ttcctcttcc tetgegtgat gtgggcctcc 60 
ccgtccgccg cctccgcgga cgagccgtcc gacccgatga tgaagegett cgaggagtgg 120 
atggtggagt acggccgcgt gtacaaggac aacgacgaga agatgegecg cttccagatc 180 
ttcaagaaca aegtgaacca catcgagacc ttcaactccc gcaacgagaa ctcctacacc 240 



245 
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ctcggcatca 


accagttcac 


cgacatgacc 


aacaacgagt 


tcatcgccca 


gtacaccggc 


300 




ggcatctccc 


gcccgctcaa 


catcgagcgc 


gagccggtgg 


tgtccttcga 


cgacgtggac 


360 


5 


atctccgccg 


tgccgcagtc 


catcgactgg 


cgcgactacg 


gcgccgtgac 


ctccgtgaag 


420 




aaccagaacc 


cgtgcggcgc 


ctgctgggcc 


ttcgccgcca 


tcgccaccgt 


ggagtccatc 


480 


10 


tacaagatca 


agaagggcat 


cctcgagccg 


ctctccgagc 


ageaggtget 


cgactgcgcc 


540 


aagggctacg 


getgeaaggg 


eggctgggag 


ttccgcgcct 


tcgagt teat 


catctccaac 


600 




aagggcgtgg 


cctccggcgc 


catctacccg 


tacaaggecg 


ccaagggcac 


ctgcaagacc 


660 


15 


gacggcgtgc 


cgaactccgc 


ctacatcacc 


ggctacgccc 


gcgtgccgcg 


caacaacgag 


720 




tec tccatga 


tgtacgccgt 


gtccaagcag 


ccgatcaccg 


tggccgtgga 


cgccaacgcc 


780 


20 


aact tccagt 


actacaagtc 


cggcgtgttc 


aacggcccgt 


gcggcacctc 


cctcaaccac 


840 


gccgtgaccg 


ccatcggcta 


cggccaggac 


tccatcatct 


acccgaagaa 


gtggggcgcc 


900 




aagtggggcg 


aggceggcta 


catccgcatg 


gcccgcgacg 


tgtcctcctc 


ctccggcatc 


960 


25 


tgeggcateg 
tag 


ccatcgaccc 


gctctacccg 


accctcgagg 


agtccgagaa 


ggacgagctg 


1020 
1023 


30 


<210> 76 
<211> 990 
<212> DNA 

<213> Artificial Sequence 










35 


<220> 

<223> pSYN12169 Sequence 










40 


<400> 76 
atgagggtgt 


tgctcgttgc 


cctcgctctc 


ctggctctcg 


ctgcgagcgc 


cacctccatg 


60 


geggacgage 


cgtccgaccc 


gatgatgaag 


cgcttcgagg 


agtggatggt 


ggagtaegge 


120 




cgcgtgtaca 


aggacaacga 


cgagaagatg 


cgccgct tec 


agatcttcaa 


gaacaacgtg 


180 


45 


aaccacatcg 


agaccttcaa 


ctcccgcaac 


gagaactcct 


acaccctcgg 


catcaaccag 


240 




ttcaccgaca 


tgaccaacaa 


cgagttcatc 


gcccagtaca 


ccggcggcat 


ctcccgcccg 


300 


50 


ctcaacatcg 


agcgcgagcc 


ggtggtgtcc 


ttcgacgacg 


tggacatctc 


cgccgtgccg 


360 


cagtccatcg 


actggegega 


ctacggcgcc 


gtgacctccg 


tgaagaacca 


gaacccgtgc 


420 




ggcgcctgct 


gggccttcgc 


cgccatcgcc 


accgtggagt 


ccatctacaa 


gatcaagaag 


480 


55 


ggcatcctcg 


agccgctctc 


cgagcagcag 


gtgetcgact 


gcgccaaggg 


ctacggctgc 


540 




aagggegget 


gggagttccg 


cgccttcgag 


ttcatcatct 


ccaacaaggg 


cgtggcctcc 


600 
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ggcgccatct 


acccgtacaa 


ggccgccaag 


5 


tccgcctaca 


tcaccggcta 


cgcccgcgtg 




gccgtgtcca 


agcagccgat 


caccgtggcc 




aaqtccqqcq 


tgttcaacgg 


cccgtgcggc 


10 


ggctacggcc 


aggactccat 


catctacccg 




ggctacatcc 


gcatggcccg 


cgacgtgtcc 


15 


gacccgctct 


acccgaccct 


cgaggagtag 


20 


<210> 77 
<211> 1170 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pSYN12575 Sequence 


25 


<400> 77 
atgctggcgg 


ctctggccac 


gtcgcagctc 




gacgcgtcca 


cgttccgccg 


cggcgccgcg 


30 


gcggcggaca 


cgctcagcat 


gcggaccagc 




caggcgcgcc 


gcggggccag 


gttcccgtcg 


35 


gcggacgagc 
cgcgtgtaca 


cgtccgaccc 
aggacaacga 


gatgatgaag 
cgagaagatg 




aaccacatcg 


agaccttcaa 


ctcccgcaac 


40 


ttcaccgaca 


tgaccaacaa 


cgagttcatc 




ctcaacatcg 


agcgcgagcc 


ggtggtgtcc 


45 


cagtccatcg 
ggcgcctgct 


actggcgcga 
gggccttcgc 


ctacggcgcc 
cgccatcgcc 




ggcatcctcg 


agccgctctc 


cgagcagcag 


50 


aagggcggct 


gggagttccg 


cgccttcgag 




ggcgccatct 


acccgtacaa 


ggccgccaag 


55 


tccgcctaca 
gccgtgtcca 


tcaccggcta 
agcagccgat 


cgcccgcgtg 
caccgtggcc 



PCT/US2004/007182 



ggcacctgca 


agaccgaegg 


cgcgccgaac 


t~ cz r\ 


ccgcgcaaca 


acgagt cccc 


cat.yaugL.ac 


/ £. U 


gtggacgcca 


acgccaactt 


ccagtactac 


'■j 0 r\ 

7 8 0 


acctccctca 


accacgccgt 


gaccgccatc 


840 


aagaagtggg 


gcgccaagtg 


gggegaggee 


900 


tcctcctccg 


geatctgegg 


catcgccatc 


960 








990 


gtcgcaacgc 


gcgccggcct 


999 c 9 tccc 9 


0 u 


cagggcctga 


9g9999 ccc 9 


ggcgtcggcg 


1 ^ u 


gcgcgcgcgg 


cgcccaggca 


ccagcaccag 


T Q A 
1 O U 


ctcgtcgtgt 


gcgccagcgc 


cggcgccatg 


O /I A 

24 0 


cgcttcgagg 


agtggatggt 


ggagtaegge 


T A A 


cgccgc t tec 


- - _|H 1 4** 4b b»*i B"B» 

agaccttcaa 


***** «•> fBB. ^B MB> 4— < M 

gaacaacgcg 


"J ^ A 


gagaac tcct 


acaccctcgg 


A 4* ^BB A M *«\ 

catcaaccag 


4 O A 


gcccagtaca 


ccggcggcau 


ctcccgcccg 


4 Oft 


4-^ 4 I J » >B — M <■ L >■ M *V ^ B J M 

t t cgacgacg 


4— J— -V y—B. — » /— B — s 4V- 4— — B. 

tggacatctc 


— . ---> /-^r g^t /*4 /^f 

Cyccgcyccg 


DriU 


gtgacc tccg 


4b* _r~_f rfB^ bm bk 

tgaagaacca 


gaacccgcgc 




accgtggagc 




gaLCaayciay 


£ n 

D D U 


gcgcccgaci- 


gcgccaaggg 






ttcatcatct 


ccaacaaggg 


cgtggcctcc 


780 


ggcacctgca 


agaccgaegg 


cgtgccgaac 


840 


ccgcgcaaca 


acgagtcctc 


catgatgtac 


900 


gtggacgcca 


acgccaactt 


ccagtactac 


960 
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aagtccggcg 


tgttcaacgg 


cccgtgcggc 


acctccctca 


accacgccgt 


gaccgccatc 


1020 


ggctacggcc 


aggactccat 


catctacccg 


aagaagtggg 


gcgccaagtg 


gggegaggee 


1080 


ggctacatcc 


gcatggcccg 


cgacgtgtcc 


tcctcctccg 


geatctgegg 


catcgccatc 


1140 


gacccgctct 


acccgaccct 


cgaggagtag 








1170 


<210> 78 
<211> 1068 
<212> DNA 

<213> Artificial Sequence 






• 




<220> 

<223> pSM270 Sequence 










<400> 78 

r» t~~ /-* n r~* c~* \~ nns 

at-ygcLLyyd 


«yy L y ^ a yy *- 


aatattcctc 


ttcctcttcc 


tetgegtgat 


gtgggcctcc 


60 


v<^y u. y ^- y 


r*ctccocctc 

w *^-» \— Nw« 


ctcct cc tec 


ttcgccgact 


ccaacccgat 


ccgcccggtg 


120 


aLuya^L.y^.y 


ccaccticcsLC 

V.* >*>H >w> # V-» 


cnacqaQCCQ 


tccgacccga 


tgatgaagcg 


ct tcgaggag 


180 


L yy dL yy L yy 




catqtacaaq 


gacaacgacg 


agaagatgeg 


ccgcttccag 


240 


a t~ c t~ t*caaaa 

CA V— • l»» 


acaacataaa 


ccacatcqaq 


acct tcaact 


cccgcaacga 


gaactcctac 


300 


accc tcQQca 


tcaaccaqtt 


caccgacatg 


accaacaacg 


agttcatcgc 


ccagtacacc 


360 


ggcggcatct 


cccgcccgct 


caacatcgag 


cgcgagccgg 


tggtgtcctt 


egacgaegtg 


420 


gacatctccg 


ccgtgccgca 


gtccatcgac 


tggegegact 


acggcgccgt 


gacctccgtg 


480 


aagaaccaga 


acccgtgcgg 


cgcctgctgg 


gccttcgccg 


ccatcgccac 


cgtggagtcc 


540 


atctacaaga 


tcaagaaggg 


catcctcgag 


ccgctctccg 


agcagcaggt 


gctcgactgc 


600 


gccaagggct 


acggctgcaa 


gggcggctgg 


gagttccgcg 


ccttcgagtt 


catcatctcc 


660 


aacaagggcg 


tggcctccgg 


cgccatctac 


cegtacaagg 


ccgccaaggg 


cacctgcaag 


720 


accgacggcg 


tgccgaactc 


cgcctacatc 


accggctacg 


cccgcgtgcc 


gcgcaacaac 


780 


gagtcctcca 


tgatgtacgc 


cgtgtccaag 


cagccgatca 


ccgtggccgt 


ggacgccaac 


840 


gccaacttcc 


agtactacaa 


gtccggcgtg 


ttcaacggcc 


cgtgcggcac 


ctccctcaac 


900 


cacgccgtga 


ccgccatcgg 


ctacggccag 


gactccatca 


tctacccgaa 


gaagtggggc 


960 


gccaagtggg 


gcgaggccgg 


ctacatccgc 


atggcccgcg 


acgtgtcctc 


ctcctccggc 


1020 


atctgcggca 


tcgccatcga 


cccgctctac 


ccgaccctcg 


aggagtag 




1068 



<210> 79 
<211> 1497 



248 
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<212> DNA 

<213> Trichoderma reesei 



30 



acg 



48 



5 <220> 

<221> CDS 
<222> (1) . . (1497) 

<223> Trichoderma reesei eel lobiohyrodlase I 
10 <400> 79 

atg cag teg gcg tgt act etc caa teg gag act cac ccg cct ctg aca 
Met Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr 
15 10 15 

15 tgg cag aaa tgc teg tct ggt ggc acg tgc act caa cag aca ggc tec 96 
Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser 

20 25 30 

gtg gtc ate gac gee aac tgg cgc tgg act cac get acg aac age age 144 
20 Val Val He Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser 

35 40 45 



aac tgc tac gat ggc aac act tgg age teg ace eta tgt cct gac 192 
Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 
25 50 55 60 



aac gag ace tgc gcg aag aac tgc tgt ctg gac ggt gee gee tac gcg 240 

Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala 

65 70 75 80 

tec acg tac gga gtt ace acg age ggt aac age etc tec att ggc ttt 288 

Ser Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser He Gly Phe 

85 90 95 

35 gtc ace cag tct gcg cag aag aac gtt ggc get cgc ctt tac ctt atg 336 

Val Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met 

100 105 110 

gcg age gac acg ace tac cag gaa ttc acc ctg ctt ggc aac gag ttc 384 

40 Ala Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe 

115 120 125 

tct ttc gat gtt gat gtt teg cag ctg ccg tgc ggc ttg aac gga get 432 

Ser Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala 

45 130 135 140 

etc tac ttc gtg tec atg gac gcg gat ggt ggc gtg age aag tat ccc 480 

Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro 

145 150 155 160 

50 

acc aac acc get ggc gee aag tac ggc acg ggg tac tgt gac age cag 528 

Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin 

165 170 175 

55 tgt ccc cgc gat ctg aag ttc ate aat ggc cag gee aac gtt gag ggc 576 

Cys Pro Arg Asp Leu Lys Phe He Asn Gly Gin Ala Asn Val Glu Gly 

180 185 190 
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tgg gag ccg tea tec aac aac gcg aac acg ggc att gga gga cac gga 624 

Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly lie Gly Gly His Gly 

195 200 205 

5 

age tgc tgc tct gag atg gat ate tgg gag gee aac tec ate tec gag 672 

Ser Cys Cys Ser Glu Met Asp lie Trp Glu Ala Asn Ser He Ser Glu 

210 215 220 

10 get ctt acc ccc cac cct tgc acg act gtc ggc cag gag ate tgc gag 720 

Ala Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu He Cys Glu 

225 230 235 240 

ggt gat ggg tgc ggc gga act tac tec gat aac aga tat ggc ggc act 768 

15 Gly Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr 

245 250 255 

tgc gat ccc gat ggc tgc gac tgg aac cca tac cgc ctg ggc aac acc 816 

Cys Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr 

20 260 265 270 

age ttc tac ggc cct ggc tct age ttt acc etc gat acc acc aag aaa 864 

Ser Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys 

275 280 285 

25 

ttg acc gtt gtc acc cag ttc gag acg teg ggt gec ate aac cga tac 912 

Leu Thr Val Val Thr Gin Phe Glu Thr Ser Gly Ala He Asn Arg Tyr 

290 295 300 

30 tat gtc cag aat ggc gtc act ttc cag cag ccc aac gee gag ctt ggt 960 

Tyr Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn Ala Glu Leu Gly 

305 310 315 320 

agt tac tct ggc aac gag etc aac gat gat tac tgc aca get gag gag 1008 

35 Ser Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu 

325 330 335 

gca gaa ttc ggc gga tec tct ttc tea gac aag ggc ggc ctg act cag 1056 

Ala Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin 

40 340 345 350 

ttc aag aag get acc tct ggc ggc atg gtt ctg gtc atg agt ctg tgg 1104 

Phe Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp 

355 360 365 

gat gat tac tac gee aac atg ctg tgg ctg gac tec acc tac ccg aca 1152 

Asp Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr 

370 375 380 

50 aac gag acc tec tec aca ccc ggt gee gtg cgc gga age tgc tec acc 1200 

Asn Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr 

385 390 395 400 



45 



age 



tec ggt gtc cct get cag gtc gaa tct cag tct ccc aac gee aag 1248 
55 Ser Ser Gly Val Pro Ala Gin Val Glu Ser Gin Ser Pro Asn Ala Lys 

405 410 415 
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gtc acc ttc tec aac ate aag ttc gga ccc att ggc age ace ggc aac 
Val Thr Phe Ser Asn lie Lys Phe Gly Pro lie Gly Ser Thr Gly Asn 

420 425 430 



1296 



cct age ggc ggc aac cct ccc ggc gga aac ccg cct ggc acc acc acc 1344 
Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn Pro Pro Gly Thr Thr Thr 
435 440 445 



acc cgc cgc cca gec act acc act gga age tct ccc gga cct acc cag 
Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin 
450 455 460 



tgc gee age ggc aca act tgc cag gtc ctg aac cct tac tac tct cag 
Cys Ala Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin 

485 490 495 

tgc ctg taa 
Cys Leu 



<210> 80 
<211> 498 
<212> PRT 

<213> Trichoderma reesei 
<400> 80 

Met Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr 
15 10 15 



Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser 

2 0 2 5 3 0 



Val Val lie Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser 
35 40 45 



Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 
50 55 60 



Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala 
65 70 75 80 



Ser Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser lie Gly Phe 

85 90 95 



Val Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met 



1392 



tct cac tac ggc cag tgc ggc ggt att ggc tac age ggc ccc acg gtc 1440 
Ser His Tyr Gly Gin Cys Gly Gly He Gly Tyr Ser Gly Pro Thr Val 
465 " 470 475 480 



1488 



1497 
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100 105 HO 



Ala Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe 
5 115 120 125 

Ser Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala 
130 135 140 

10 

Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro 
145 150 155 160 

15 

Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin 

165 170 175 



20 Cys Pro Arg Asp Leu Lys Phe lie Asn Gly Gin Ala Asn Val Glu Gly 

180 185 190 



Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly lie Gly Gly His Gly 
25 195 200 205 

Ser Cys Cys Ser Glu Met Asp lie Trp Glu Ala Asn Ser lie Ser Glu 
210 215 220 

30 

Ala Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu He Cys Glu 
225 230 235 240 

35 

Gly Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr 

245 250 255 



40 Cys Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr 

260 265 270 



Ser Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys 
45 275 280 285 



Leu Thr Val Val Thr Gin Phe Glu Thr Ser Gly Ala He Asn Arg Tyr 
290 295 300 

50 

Tyr Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn Ala Glu Leu Gly 
305 310 315 320 

55 

Ser Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu 

325 330 335 
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Ala Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin 

340 345 350 

5 

Phe Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp 
355 360 365 

10 

Asp Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr 
370 375 380 



15 Asn Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr 
385 390 395 400 

Ser Ser Gly Val Pro Ala Gin Val Glu Ser Gin Ser Pro Asn Ala Lys 
20 405 410 415 

Val Thr Phe Ser Asn lie Lys Phe Gly Pro lie Gly Ser Thr Gly Asn 

420 425 430 

25 

Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn Pro Pro Gly Thr Thr Thr 
435 440 445 

30 

Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin 
450 455 460 

4 

35 Ser His Tyr Gly Gin Cys Gly Gly lie Gly Tyr Ser Gly Pro Thr Val 
465 470 475 480 

Cys Ala Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin 
40 485 490 495 



Cys Leu 

45 

<210> 81 
<211> 1365 
<212> DNA 
50 <213> Trichoderma reesei 



<220> 

<221> CDS 

55 <222> (1) . . (1365) 

<223> trichoderma reesei cellobiohydrolase II 
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25 



<400> 81 

atg gtg cct eta gag gag egg caa get tgc tea age gtc tgg ggc caa 4 8 

Met Val Pro Leu Glu Glu Arg Gin Ala Cys Ser Ser Val Trp Gly Gin 
15 10 15 

5 

tgt ggt ggc cag aat tgg teg ggt ccg act tgc tgt get tec gga age 96 
Cys Gly Gly Gin Asn Trp Ser Gly Pro Thr Cys Cys Ala Ser Gly Ser 

20 25 30 

10 aca tgc gtc tac tec aac gac tat tac tec cag tgt ctt ccc ggc get 144 
Thr Cys Val Tyr Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Pro Gly Ala 
35 40 45 

gca age tea age teg tec acg cgc gee gcg teg acg act tea cga gta 192 
15 Ala Ser Ser Ser Ser Ser Thr Arg Ala Ala Ser Thr Thr Ser Arg Val 
50 55 60 

tec ccc aca aca tec egg teg age tec gcg acg cct cca cct ggt tct 240 
Ser Pro Thr Thr Ser Arg Ser Ser Ser Ala Thr Pro Pro Pro Gly Ser 
20 65 70 75 80 

acc act acc aga gta cct cca gtc gga teg gga acc get acg tat tea 288 
Thr Thr Thr Arg Val Pro Pro Val Gly Ser Gly Thr Ala Thr Tyr Ser 

85 90 95 

ggc aac cct ttt gtt ggg gtc act cct tgg gee aat gca tat tac gee 336 
Gly Asn Pro Phe Val Gly Val Thr Pro Trp Ala Asn Ala Tyr Tyr Ala 

100 105 110 

30 tct gaa gtt age age etc get att cct age ttg act gga gee atg gee 384 
Ser Glu Val Ser Ser Leu Ala lie Pro Ser Leu Thr Gly Ala Met Ala 
115 120 125 

act get gca gca get gtc gca aag gtt ccc tct ttt atg tgg eta gat 432 
35 Thr Ala Ala Ala Ala Val Ala Lys Val Pro Ser Phe Met Trp Leu Asp 
130 135 140 

act ctt gac aag acc cct etc atg gag caa acc ttg gee gac ate cgc 480 
Thr Leu Asp Lys Thr Pro Leu Met Glu Gin Thr Leu Ala Asp lie Arg 
40 145 150 155 160 

acc gee aac aag aat ggc ggt aac tat gee gga cag ttt gtg gtg tat 528 

Thr Ala Asn Lys Asn Gly Gly Asn Tyr Ala Gly Gin Phe Val Val Tyr 

165 170 175 

45 

gac ttg ccg gat cgc gat tgc get gee ctt gee teg aat ggc gaa tac 576 

Asp Leu Pro Asp Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Tyr 

180 185 190 

50 tct att gee gat ggt ggc gtc gee aaa tat aag aac tat ate gac acc 624 
Ser He Ala Asp Gly Gly Val Ala Lys Tyr Lys Asn Tyr He Asp Thr 
195 200 205 

att cgt caa att gtc gtg gaa tat tec gat ate egg acc etc ctg gtt 672 
55 He Arg Gin lie Val Val Glu Tyr Ser Asp He Arg Thr Leu Leu Val 
210 215 220 
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att gag cct 
lie Glu Pro 
225 

aag tgt gcc 
Lys Cys Ala 



gtc aca cag 
Val Thr Gin 



cat gca gga 
His Ala Gly 
275 

eta ttt gca 
Leu Phe Ala 
290 

gga ttg gca 
Gly Leu Ala 
305 

ccc cca teg 
Pro Pro Ser 



ate cac get 
lie His Ala 



ttc ttc ate 
Phe Phe lie 
355 

caa cag tgg 
Gin Gin Trp 
370 

cgc cca tec 
Arg Pro Ser 
385 

gtc aag cca 
Val Lys Pro 



cga ttt gac 
Arg Phe Asp 



gac tct ctt 
Asp Ser Leu 
230 

aat get cag 
Asn Ala Gin 
245 

ctg aac ctt 
Leu Asn Leu 
260 

tgg ctt ggc 
Trp Leu Gly 



aat gtt tac 
Asn Val Tyr 



ace aat gtc 
Thr Asn Val 
310 

tac acg caa 
Tyr Thr Gin 
325 

att gga cct 
lie Gly Pro 
340 

act gat caa 
Thr Asp Gin 



gga gac tgg 
Gly Asp Trp 



gca aac act 
Ala Asn Thr 
390 

ggc ggc gag 
Gly Gly Glu 
405 

tec cac tgt 
Ser His Cys 
420 



gcc aac ctg 
Ala Asn Leu 



tea gcc tac 
Ser Ala Tyr 



cca aat gtt 
Pro Asn Val 
265 

tgg ccg gca 
Trp Pro Ala 
280 

aag aat gca 
Lys Asn Ala 
295 

gcc aac tac 
Ala Asn Tyr 



ggc aac get 
Gly Asn Ala 

ctt ctt gcc 
Leu Leu Ala 
345 

ggt cga teg 
Gly Arg Ser 
360 

tgc aat gtg 
Cys Asn Val 
375 

ggg gac teg 
Gly Asp Ser 



tgt gac ggc 
Cys Asp Gly 



gcg etc cca 
Ala Leu Pro 
425 



gtg ace aac 
Val Thr Asn 
235 

ctt gag tgc 
Leu Glu Cys 
250 

gcg atg tat 
Ala Met Tyr 



aac caa gac 
Asn Gin Asp 



teg tct ccg 
Ser Ser Pro 
300 

aac ggg tgg 
Asn Gly Trp 
315 

gtc tac aac 
Val Tyr Asn 
330 

aat cac ggc 
Asn His Gly 



gga aag cag 
Gly Lys Gin 



ate ggc ace 
lie Gly Thr 
380 

ttg ctg gat 
Leu Leu Asp 
395 

ace age gac 
Thr Ser Asp 
410 

gat gcc ttg 
Asp Ala Leu 



etc ggt act 
Leu Gly Thr 



ate aac tac 
lie Asn Tyr 
255 

ttg gac get 
Leu Asp Ala 
270 

ccg gcc get 
Pro Ala Ala 
285 

aga get ctt 
Arg Ala Leu 



aac att acc 
Asn lie Thr 



gag aag ctg 
Glu Lys Leu 
335 

tgg tec aac 
Trp Ser Asn 
350 

cct acc gga 
Pro Thr Gly 
365 

gga ttt ggt 
Gly Phe Gly 



teg ttt gtc 
Ser Phe Val 



age agt gcg 
Ser Ser Ala 
415 

caa ccg gcg 
Gin Pro Ala 
430 



cca 720 

Pro 

240 

gcc 768 
Ala 



ggc 816 
Gly 



cag 864 
Gin 



cgc 912 
Arg 

age 960 

Ser 

320 

tac 1008 
Tyr 



gcc 1056 
Ala 



cag 1104 
Gin 



att 1152 
He 



tgg 1200 

Trp 

400 

cca 1248 
Pro 



cct 1296 
Pro 



caa get ggt get tgg ttc caa gcc tac ttt gtg cag ctt etc aca aac 1344 
Gin Ala Gly Ala Trp Phe Gin Ala Tyr Phe Val Gin Leu Leu Thr Asn 
435 440 445 

gca aac cca teg ttc ctg tag 1365 
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Ala Asn Pro Ser Phe Leu 
450 



5 <210> 82 
<211> 454 
<212> PRT 

<213> Trichoderma reesei 
10 <400> 82 

Met Val Pro Leu Glu Glu Arg Gin Ala Cys Ser Ser Val Trp Gly Gin 
15 10 15 

15 

Cys Gly Gly Gin Asn Trp Ser Gly Pro Thr Cys Cys Ala Ser Gly Ser 

20 25 30 

20 Thr Cys Val Tyr Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Pro Gly Ala 

35 40 45 

Ala Ser Ser Ser Ser Ser Thr Arg Ala Ala Ser Thr Thr Ser Arg Val 
25 50 55 60 

Ser Pro Thr Thr Ser Arg Ser Ser Ser Ala Thr Pro Pro Pro Gly Ser 
65 70 75 80 

30 

Thr Thr Thr Arg Val Pro Pro Val Gly Ser Gly Thr Ala Thr Tyr Ser 

85 90 95 

35 

Gly Asn Pro Phe Val Gly Val Thr Pro Trp Ala Asn Ala Tyr Tyr Ala 

100 105 110 



40 Ser Glu Val Ser Ser Leu Ala lie Pro Ser Leu Thr Gly Ala Met Ala 

115 120 125 



Thr Ala Ala Ala Ala Val Ala Lys Val Pro Ser Phe Met Trp Leu Asp 
45 130 135 140 

Thr Leu Asp Lys Thr Pro Leu Met Glu Gin Thr Leu Ala Asp lie Arg 
145 150 155 160 

50 

Thr Ala Asn Lys Asn Gly Gly Asn Tyr Ala Gly Gin Phe Val Val Tyr 

165 170 175 

55 

Asp Leu Pro Asp Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Tyr 

180 185 190 
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Ser He Ala Asp Gly Gly Val Ala Lys Tyr Lys Asn Tyr He Asp Thr 
195 200 205 

5 

He Arg Gin He Val Val Glu Tyr Ser Asp He Arg Thr Leu Leu Val 
210 215 220 

10 ♦ 

He Glu Pro Asp Ser Leu Ala Asn Leu Val Thr Asn Leu Gly Thr Pro 

225 230 235 240 

15 Lys Cys Ala Asn Ala Gin Ser Ala Tyr Leu Glu Cys lie Asn Tyr Ala 

245 250 255 



Val Thr Gin Leu Asn Leu Pro Asn Val Ala Met Tyr Leu Asp Ala Gly 
20 260 265 270 

His Ala Gly Trp Leu Gly Trp Pro Ala Asn Gin Asp Pro Ala Ala Gin 
275 280 285 

25 

Leu Phe Ala Asn Val Tyr Lys Asn Ala Ser Ser Pro Arg Ala Leu Arg 
290 295 300 

30 

Gly Leu Ala Thr Asn Val Ala Asn Tyr Asn Gly Trp Asn He Thr Ser 
305 310 315 320 

35 Pro Pro Ser Tyr Thr Gin Gly Asn Ala Val Tyr Asn Glu Lys Leu Tyr 

325 330 335 

He His Ala He Gly Pro Leu Leu Ala Asn His Gly Trp Ser Asn Ala 
40 340 345 350 

Phe Phe He Thr Asp Gin Gly Arg Ser Gly Lys Gin Pro Thr Gly Gin 
355 360 365 

45 

Gin Gin Trp Gly Asp Trp Cys Asn Val He Gly Thr Gly Phe Gly He 
370 375 380 

50 

Arg Pro Ser Ala Asn Thr Gly Asp Ser Leu Leu Asp Ser Phe Val Trp 
385 390 395 400 

55 Val Lys Pro Gly Gly Glu Cys Asp Gly Thr Ser Asp Ser Ser Ala Pro 

405 410 415 
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Arg Phe Asp Ser His Cys Ala Leu Pro Asp Ala Leu Gin Pro Ala Pro 

420 425 430 



Gin Ala Gly Ala Trp Phe Gin Ala Tyr Phe Val Gin Leu Leu Thr Asn 
435 440 445 



Ala Asn Pro Ser Phe Leu 
450 



<210> 83 

<211> 1317 

< 2 1 2 > DNA 

<213> Trichoderma reesei 



<220> 

<221> CDS 

<222> (1) . - (1317) 

<223> Trichoderma reesei endoglucanase I 
<400> 83 

atg cag caa ccg gga acc age acc ccc gag gtc cat ccc aag ttg aca 48 
Met Gin Gin Pro Gly Thr Ser Thr Pro Glu Val His Pro Lys Leu Thr 
15 10 15 

acc tac aag tgc aca aag tec ggg ggg tgc gtg gec cag gac acc teg 96 
Thr Tyr Lys Cys Thr Lys Ser Gly Gly Cys Val Ala Gin Asp Thr Ser 

20 25 30 

gtg gtc ctt gac tgg aac tac cgc tgg atg cac gac gca aac tac aac 144 
Val Val Leu Asp Trp Asn Tyr Arg Trp Met His Asp Ala Asn Tyr Asn 
35 40 45 

teg tgc acc gtc aac ggc ggc gtc aac acc acg etc tgc cct gac gag 192 
Ser Cys Thr Val Asn Gly Gly Val Asn Thr Thr Leu Cys Pro Asp Glu 
50 55 60 

gcg acc tgt ggc aag aac tgc ttc ate gag ggc gtc gac tac gec gee 24 0 

Ala Thr Cys Gly Lys Asn Cys Phe lie Glu Gly Val Asp Tyr Ala Ala 
65 70 75 80 

teg ggc gtc acg acc teg ggc age age etc acc atg aac cag tac atg 288 
Ser Gly Val Thr Thr Ser Gly Ser Ser Leu Thr Met Asn Gin Tyr Met 

85 90 95 

ccc age age tct ggc ggc tac age age gtc tct cct egg ctg tat etc 336 
Pro Ser Ser Ser Gly Gly Tyr Ser Ser Val Ser Pro Arg Leu Tyr Leu 

100 105 110 

ctg gac tct gac ggt gag tac gtg atg ctg aag etc aac ggc cag gag 384 
Leu Asp Ser Asp Gly Glu Tyr Val Met Leu Lys Leu Asn Gly Gin Glu 
115 120 125 
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ctg age ttc 
Leu Ser Phe 
130 

teg etc tac 
Ser Leu Tyr 
145 

aac acg gec 
Asn Thr Ala 



ccc gtc cag 
Pro Val Gin 



ttc tgc tgc 
Phe Cys Cys 
195 

gec ttg acc 
Ala Leu Thr 
210 

tgc ggc ttc 
Cys Gly Phe 
225 

gga gat acc 
Gly Asp Thr 



aac acg gac 
Asn Thr Asp 



aag tac cag 
Lys Tyr Gin 
275 

gac acc ate 
Asp Thr lie 
290 

acc atg ggc 
Thr Met Gly 
305 

tgg aac gac 
Trp Asn Asp 



ggc ccc tgc 
Gly Pro Cys 



aac ccc aac 



gac gtc gac 
Asp Val Asp 



ctg tct cag 
Leu Ser Gin 
150 

ggt gee aac 
Gly Ala Asn 
165 

aca tgg agg 
Thr Trp Arg 
180 

aac gag atg 
Asn Glu Met 



cct cac tct 
Pro His Ser 



aac ccc tat 
Asn Pro Tyr 
230 

gtt gac acc 
Val Asp Thr 
245 

aac ggc teg 
Asn Gly Ser 
260 

caa aac ggc 
Gin Asn Gly 



teg tec tgc 
Ser Ser Cys 



aag gec ctg 
Lys Ala Leu 
310 

aac age cag 
Asn Ser Gin 
325 

age age acc 
Ser Ser Thr 
340 

acg cac gtc 



etc tct get 
Leu Ser Ala 
135 

atg gac gag 
Met Asp Glu 



tac ggg age 
Tyr Gly Ser 



aac ggc acc 
Asn Gly Thr 
185 

gat ate ctg 
Asp lie Leu 
200 

tgc acg gec 
Cys Thr Ala 
215 

ggc age ggc 
Gly Ser Gly 



tec aag acc 
Ser Lys Thr 



ccc teg ggc 
Pro Ser Gly 
265 

gtc gac ate 
Val Asp lie 
280 

ccg tec gee 
Pro Ser Ala 
295 

age age ggc 
Ser Ser Gly 



tac atg aac 
Tyr Met Asn 



gag ggc aac 
Glu Gly Asn 
345 

gtc ttc tec 



ctg ccg tgt 
Leu Pro Cys 
140 

aac ggg ggc 
Asn Gly Gly 
155 

ggc tac tgc 
Gly Tyr Cys 
170 

etc aac act 
Leu Asn Thr 



gag ggc aac 
Glu Gly Asn 



acg gee tgc 
Thr Ala Cys 
220 

tac aaa age 
Tyr Lys Ser 
235 

ttc acc ate 
Phe Thr lie 
250 

aac ctt gtg 
Asn Leu Val 



ccc age gec 
Pro Ser Ala 



tea gee tac 
Ser Ala Tyr 
300 

atg gtg etc 
Met Val Leu 
315 

tgg etc gac 
Trp Leu Asp 
330 

cca tec aac 
Pro Ser Asn 



aac ate cgc 



gga gag aac 
Gly Glu Asn 



gee aac cag 
Ala Asn Gin 



gat get cag 
Asp Ala Gin 
175 

age cac cag 
Ser His Gin 
190 

teg agg gcg 
Ser Arg Ala 
205 

gac tct gee 
Asp Ser Ala 



tac tac ggc 
Tyr Tyr Gly 



ate acc cag 
lie Thr Gin 
255 

age ate acc 
Ser lie Thr 
270 

cag ccc ggc 
Gin Pro Gly 
285 

ggc ggc etc 
Gly Gly Leu 



gtg ttc age 
Val Phe Ser 



age ggc aac 
Ser Gly Asn 
335 

acc ctg gee 
Thr Leu Ala 
350 

tgg gga gac 



ggc 432 
Gly 



tat 480 

Tyr 

160 

tgc 528 
Cys 



ggc 576 
Gly 

aat 624 
Asn 



ggt 672 
Gly 



ccc 720 

Pro 

240 

ttc 768 
Phe 



cgc 816 
Arg 



ggc 864 
Gly 



gee 912 
Ala 



att 960 

He 

320 

gee 1008 
Ala 



aac 1056 
Asn 



att 1104 
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Asn Pro Asn Thr His Val Val Phe Ser Asn He Arg Trp Gly Asp He 
355 360 365 

ggg tct act acg aac teg act gcg ccc ccg ccc ccg cct gcg tec age 1152 
Gly Ser Thr Thr Asn Ser Thr Ala Pro Pro Pro Pro Pro Ala Ser Ser 
370 375 380 



acg acg ttt teg act aca egg agg age teg acg act teg age age ccg 
Thr Thr Phe Ser Thr Thr Arg Arg Ser Ser Thr Thr Ser Ser Ser Pro 
385 390 395 400 



ggg tgc aag acg tgc acg teg ggc act acg tgc cag tat age aac gac 
Gly Cys Lys Thr Cys Thr Ser Gly Thr Thr Cys Gin Tyr Ser Asn Asp 

420 425 430 

tac tac teg caa tgc ctt tag 
Tyr Tyr Ser Gin Cys Leu 
435 



<210> 84 

<211> 438 

<212> PRT 

<213> Trichoderma reesei 

<400> 84 

Met Gin Gin Pro Gly Thr Ser Thr Pro Glu Val His Pro Lys Leu Thr 
1 5 10 15 



Thr Tyr Lys Cys Thr Lys Ser Gly Gly Cys Val Ala Gin Asp Thr Ser 

20 25 30 



Val Val Leu Asp Trp Asn Tyr Arg Trp Met His Asp Ala Asn Tyr Asn 
35 40 45 



Ser Cys Thr Val Asn Gly Gly Val Asn Thr Thr Leu Cys Pro Asp Glu 
50 55 60 



Ala Thr Cys Gly Lys Asn Cys Phe He Glu Gly Val Asp Tyr Ala Ala 
65 70 75 80 



Ser Gly Val Thr Thr Ser Gly Ser Ser Leu Thr Met Asn Gin Tyr Met 

85 90 95 



Pro Ser Ser Ser Gly Gly Tyr Ser Ser Val Ser Pro Arg Leu Tyr Leu 

100 105 HO 



1200 



age tgc acg cag act cac tgg ggg cag tgc ggt ggc att ggg tac age 1248 
Ser Cys Thr Gin Thr His Trp Gly Gin Cys Gly Gly He Gly Tyr Ser 

405 410 415 



1296 



1317 
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Leu Asp Ser Asp Gly Glu Tyr Val Met Leu Lys Leu Asn Gly Gin Glu 
115 120 ~ 125 

5 

Leu Ser Phe Asp Val Asp Leu Ser Ala Leu Pro Cys Gly Glu Asn Gly 
130 135 140 

10 

Ser Leu Tyr Leu Ser Gin Met Asp Glu Asn Gly Gly Ala Asn Gin Tyr 
145 150 155 160 

15 Asn Thr Ala Gly Ala Asn Tyr Gly Ser Gly Tyr Cys Asp Ala Gin Cys 

165 170 175 

Pro Val Gin Thr Trp Arg Asn Gly Thr Leu Asn Thr Ser His Gin Gly 
20 180 185 190 

Phe Cys Cys Asn Glu Met Asp He Leu Glu Gly Asn Ser Arg Ala Asn 
195 200 205 

25 

Ala Leu Thr Pro His Ser Cys Thr Ala Thr Ala Cys Asp Ser Ala Gly 
210 215 220 

30 

Cys Gly Phe Asn Pro Tyr Gly Ser Gly Tyr Lys Ser Tyr Tyr Gly Pro 
225 230 235 240 

35 Gly Asp Thr Val Asp Thr Ser Lys Thr Phe Thr He He Thr Gin Phe 

245 250 255 

Asn Thr Asp Asn Gly Ser Pro Ser Gly Asn Leu Val Ser He Thr Arg 
40 260 265 r 270 

Lys Tyr Gin Gin Asn Gly Val Asp He Pro Ser Ala Gin Pro Gly Gly 
275 280 285 

45 

Asp Thr He Ser Ser Cys Pro Ser Ala Ser Ala Tyr Gly Gly Leu Ala 
290 295 300 

50 

Thr Met Gly Lys Ala Leu Ser Ser Gly Met Val Leu Val Phe Ser He 
305 310 315 320 

>5 Trp Asn Asp Asn Ser Gin Tyr Met Asn Trp Leu Asp Ser Gly Asn Ala 

325 330 " 335 
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Gly Pro Cys Ser Ser Thr Glu Gly Asn Pro Ser Asn Thr -Leu Ala Asn 

340 345 350 



Asn Pro Asn Thr His Val Val Phe Ser Asn He Arg Trp Gly Asp He 
355 360 365 



Gly Ser Thr Thr Asn Ser Thr Ala Pro Pro Pro Pro Pro Ala Ser Ser 
370 375 380 



Thr Thr Phe Ser Thr Thr Arg Arg Ser Ser Thr Thr Ser Ser Ser Pro 
385 390 395 400 



Ser Cys Thr Gin Thr His Trp Gly Gin Cys Gly Gly He Gly Tyr Ser 

405 410 415 



Gly Cys Lys Thr Cys Thr Ser Gly Thr Thr Cys Gin Tyr Ser Asn Asp 

'420 425 430 



Tyr Tyr Ser Gin Cys Leu 
435 



<210> 85 

<211> 954 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 6GP1 



<220> 

<221> CDS 

<222> (1) . . (954) 

<223> 6GP1 

<400> 85 

atg ggc gtg gac ccg ttc gag cgc aac aag ate etc ggc cgc ggc ate 

Met Gly Val Asp Pro Phe Glu Arg Asn Lys He Leu Gly Arg Gly He 
15 10 15 

aac ate ggc aac gee ctg gag gee ccg aac gag ggc gac tgg ggc gtg 
Asn He Gly Asn Ala Leu Glu Ala Pro Asn Glu Gly Asp Trp Gly Val 

20 25 30 



48 



96 



gtg ate aag gac gag ttc ttc gac ate ate aag gag gee ggc ttc tec 144 
Val He Lys Asp Glu Phe Phe Asp He He Lys Glu Ala Gly Phe Ser 
35 40 45 



cac gtg cgc ate ccg ate cgc tgg tec ace cac gee tac gee ttc ccg 



192 
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His Val Arg He Pro He Arg Trp Ser Thr His Ala Tyr Ala Phe Pro 
50 55 60 

ccg tac aag ate atg gac cgc ttc ttc aag cgc gtg gac gag gtg ate 240 

5 Pro Tyr Lys He Met Asp Arg Phe Phe Lys Arg Val Asp Glu Val He 
65 70 75 80 

aac ggc gec etc aag cgc ggc etc gee gtg gee ate aac ate cac cac 288 

Asn Gly Ala Leu Lys Arg Gly Leu Ala Val Ala He Asn He His His 

10 85 90 95 

tac gag gag etc atg aac gac ccg gag gag cac aag gag cgc ttc etc 336 

Tyr Glu Glu Leu Met Asn Asp Pro Glu Glu His Lys Glu Arg Phe Leu 

100 105 HO 

15 

gee etc tgg aag cag ate gee gac cgc tac aag gac tac ccg gag ace 384 

Ala Leu Trp Lys Gin He Ala Asp Arg Tyr Lys Asp Tyr Pro Glu Thr 
115 120 125 

20 etc ttc ttc gag ate etc aac gag ccg cac ggc aac etc ace ccg gag 432 

Leu Phe Phe Glu He Leu Asn Glu Pro His Gly Asn Leu Thr Pro Glu 
130 135 140 

aag tgg aac gag ctg etc gag gag gec etc aag gtg ate cgc tec ate 480 

25 Lys Trp Asn Glu Leu Leu Glu Glu Ala Leu Lys Val He Arg Ser He 
145 150 155 160 

gac aag aag cac ace ate ate att ggc ace gca gag tgg gga ggc ate 528 

Asp Lys Lys His Thr He He He Gly Thr Ala Glu Trp Gly Gly He 

30 165 170 175 

tec gec etc gag aag etc tec gtg ccg aag tgg gag aag aat tec ate 576 

Ser Ala Leu Glu Lys Leu Ser Val Pro Lys Trp Glu Lys Asn Ser He 

180 185 190 

gtg acc ate cac tac tac aac ccg ttc gag ttc acg cac cag ggc gee 624 

Val Thr He His Tyr Tyr Asn Pro Phe Glu Phe Thr His Gin Gly Ala 
195 200 205 

40 gag tgg gtg gag ggc tec gag aag tgg ctt ggc cgc aag tgg ggc tec 672 

Glu Trp Val Glu Gly Ser Glu Lys Trp Leu Gly Arg Lys Trp Gly Ser 
210 215 220 

ccg gac gac cag aag cac etc ate gag gag ttc aac ttc ate gag gag 720 

45 Pro Asp Asp Gin Lys His Leu He Glu Glu Phe Asn Phe He Glu Glu 
225 230 235 240 

tgg tec aag aag aac aag cgc ccg ate tac ate ggc gag ttt ggc gee 768 

Trp Ser Lys Lys Asn Lys Arg Pro He Tyr He Gly Glu Phe Gly Ala 

50 245 250 255 

tac cgc aag gee gac etc gag tec cgc ate aag tgg acc tec ttc gtg 816 

Tyr Arg Lys Ala Asp Leu Glu Ser Arg He Lys Trp Thr Ser Phe Val 

260 265 270 

55 

gtg cgt gag atg gag aag cgc cgc tgg tec tgg gee tac tgg gag ttc 864 

Val Arg Glu Met Glu Lys Arg Arg Trp Ser Trp Ala Tyr Trp Glu Phe 
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25 



40 



45 



275 280 285 

tgc tec ggc ttc ggc gtg tac gac acc etc cgc aag acc tgg aac aag 
Cys Ser Gly Phe Gly Val Tyr Asp Thr Leu Arg Lys Thr Trp Asn Lys 

3 00 



5 


290 


295 




gac etc 


etc gag gee etc ate 




Asp Leu 


Leu Glu Ala Leu lie 




3 05 




10 








<210> 


86 




<211> 


317 




<212> 


PRT 


15 


<213> 


Artificial Sequence 




<220> 






<223> 


Synthetic Construct 


20 


<400> 


86 



315 



Met Gly Val Asp Pro Phe Glu Arg Asn Lys lie Leu Gly Arg Gly lie 
1*5 10 15 



Asn lie Gly Asn Ala Leu Glu Ala Pro Asn Glu Gly Asp Trp Gly Val 

2 0 2 5 3 0 



30 Val lie Lys Asp Glu Phe Phe Asp lie lie Lys Glu Ala Gly Phe Ser 

35 40 45 



His Val Arg lie Pro lie Arg Trp Ser Thr His Ala Tyr Ala Phe Pro 
35 50 55 60 

Pro Tyr Lys lie Met Asp Arg Phe Phe Lys Arg Val Asp Glu Val lie 

65 70 ~ 75 80 



Asn Gly Ala Leu Lys Arg Gly Leu Ala Val Ala lie Asn lie His His 

85 90 95 



Tyr Glu Glu Leu Met Asn Asp Pro Glu Glu His Lys Glu Arg Phe Leu 

100 105 110 



50 Ala Leu Trp Lys Gin lie Ala Asp Arg Tyr Lys Asp Tyr Pro Glu Thr 

115 120 125 



Leu Phe Phe Glu lie Leu Asn Glu Pro His Gly Asn Leu Thr Pro Glu 
55 130 135 140 



912 



954 
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Lys Trp Asn Glu Leu Leu Glu Glu Ala Leu Lys Val lie Arg Ser lie 
145 150 155 160 

5 Asp Lys Lys His Thr lie lie He Gly Thr Ala Glu Trp Gly Gly He 

165 170 175 

Ser Ala Leu Glu Lys Leu Ser Val Pro Lys Trp Glu Lys Asn Ser He 
10 180 185 190 

Val Thr He His Tyr Tyr Asn Pro Phe Glu Phe Thr His Gin Gly Ala 
195 200 205 

15 

Glu Trp Val Glu Gly Ser Glu Lys Trp Leu Gly Arg Lys Trp Gly Ser 
210 215 220 

20 

Pro Asp Asp Gin Lys His Leu He Glu Glu Phe Asn Phe He Glu Glu 
225 230 235 240 

25 Trp Ser Lys Lys Asn Lys Arg Pro He Tyr He Gly Glu Phe Gly Ala 

245 250 255 



Tyr Arg Lys Ala Asp Leu Glu Ser Arg He Lys Trp Thr Ser Phe Val 
30 260 265 270 

Val Arg Glu Met Glu Lys Arg Arg Trp Ser Trp Ala Tyr Trp Glu Phe 
275 280 285 

35 

Cys Ser Gly Phe Gly Val Tyr Asp Thr Leu Arg Lys Thr Trp Asn Lys 
290 295 300 

40 

Asp Leu Leu Glu Ala Leu He Gly Gly Asp Ser lie Glu 
305 310 315 



45 <210> 87 

<211> 1248 

<212> DNA 

<213> Hordeum vulgare 

50 

<220> 

<221> CDS 

<222> (1) . . (1248) 

<223> Barley Amyl amylase 

55 

<400> 87 

atg gca cac caa gtc etc ttt cag ggg ttc aac tgg gag teg tgg aag 



48 
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Met Ala His 
1 

cag age ggc 
Gin Ser Gly 



gec get gec 
Ala Ala Ala 
35 

gtc tec aac 
Val Ser Asn 
50 

tec aag tac 
Ser Lys Tyr 
65 

cac ggc aag 
His Gly Lys 



tgc gee gac 
Cys Ala Asp 



ggc ace tec 
Gly Thr Ser 
115 

gac gac ace 
Asp Asp Thr 
130 

gac ttc gec 
Asp Phe Ala 
145 

cgc gag etc 
Arg Glu Leu 



gac gcg tgg 
Asp Ala Trp 



aag gtg tac 
Lys Val Tyr 
195 

tgg gac aat 
Trp Asp Asn 
210 

gac gcg cac 
Asp Ala His 



Gin Val Leu 
5 

ggg tgg tac 
Gly Trp Tyr 
20 

gga gtc ace 
Gly Val Thr 



gaa ggt tac 
Glu Gly Tyr 



ggc aac gcg 
Gly Asn Ala 
70 

ggc gtg cag 
Gly Val Gin 
85 

tac aag gat 
Tyr Lys Asp 
100 

gac ggc cgc 
Asp Gly Arg 



aaa tac tec 
Lys Tyr Ser 



gee gcg ccc 
Ala Ala Pro 
150 

aag gag tgg 
Lys Glu Trp 
165 

cgc ctt gac 
Arg Leu Asp 
180 

ate gac ggc 
lie Asp Gly 



atg gee acc 
Met Ala Thr 



egg cag aat 
Arg Gin Asn 



Phe Gin Gly 



aac atg atg 
Asn Met Met 
25 

cac gtc tgg 
His Val Trp 
40 

atg cct ggt 
Met Pro Gly 
55 

gcg gag etc 
Ala Glu Leu 



gee ate gec 
Ala lie Ala 



age cgc ggc 
Ser Arg Gly 
105 

etc gac tgg 
Leu Asp Trp 
120 

gat ggc acc 
Asp Gly Thr 
135 

gac ate gac 
Asp lie Asp 



etc etc tgg 
Leu Leu Trp 



ttc gee agg 
Phe Ala Arg 
185 

aca tec ccg 
Thr Ser Pro 
200 

ggc ggc gac 
Gly Gly Asp 
215 

ctg gtg aac 
Leu Val Asn 



Phe Asn Trp 
10 

atg ggc aag 
Met Gly Lys 



ctg cca ccg 
Leu Pro Pro 



egg ctg tac 
Arg Leu Tyr 
60 

aag teg etc 
Lys Ser Leu 
75 

gac ate gtc 
Asp He Val 
90 

ate tac tgc 
He Tyr Cys 



ggc ccc cac 
Gly Pro His 



gca aac etc 
Ala Asn Leu 
140 

cac etc aac 
His Leu Asn 
155 

etc aag age 
Leu Lys Ser 
170 

ggc tac teg 
Gly Tyr Ser 



age etc gee 
Ser Leu Ala 



ggc aag ccc 
Gly Lys Pro 
220 

tgg gtg gac 
Trp Val Asp 



Glu Ser Trp 

15 

gtc gac gac 
Val Asp Asp 
30 

ccg teg cac 
Pro Ser His 
45 

gac ate gac 
Asp He Asp 



ate ggc gcg 
He Gly Ala 



ate aac cac 
He Asn His 
95 

ate ttc gag 
He Phe Glu 
110 

atg ate tgt 
Met He Cys 
125 

gac acc gga 
Asp Thr Gly 



gac egg gtc 
Asp Arg Val 



gac etc ggc 
Asp Leu Gly 
175 

ccg gag atg 
Pro Glu Met 
190 

gtg gee gag 
Val Ala Glu 
205 

aac tac gac 
Asn Tyr Asp 



aag gtg ggc 
Lys Val Gly 



Lys 



ate 96 
He 



tec 144 
Ser 



gcg 192 
Ala 



etc 240 

Leu 

80 

cgc 288 
Arg 



ggc 3 36 

Gly 

cgc 384 
Arg 

gee 432 
Ala 



cag 480 

Gin 

160 

ttc 528 
Phe 



gee 576 
Ala 



gtg 624 
Val 



cag 672 
Gin 



ggc 720 

Gly 



266 
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225 230 235 240 

gcg gcc teg gca ggc atg gtg ttc gac ttc acg acc aaa ggg ata ctg 768 

Ala Ala Ser Ala Gly Met Val Phe Asp Phe Thr Thr Lys Gly lie Leu 

245 250 255 

aac get gcc gtg gag ggc gag ctg tgg agg ctg ate gac ccg cag ggg 816 

Asn Ala Ala Val Glu Gly Glu Leu Trp Arg Leu lie Asp Pro Gin Gly 

260 265 270 

aag gcc ccc ggc gtg atg gga tgg tgg ccg gcc aag gcc gtc acc ttc 864 

Lys Ala Pro Gly Val Met Gly Trp Trp Pro Ala Lys Ala Val Thr Phe 
275 280 285 

15 gtc gac aac cac gat aca ggc tec acg cag gcc atg tgg cca ttc ccc 912 

Val Asp Asn His Asp Thr Gly Ser Thr Gin Ala Met Trp Pro Phe Pro 
290 295 300 

tec gac aag gtc atg cag ggc tac gcg tac ate etc acc cac ccc ggc 960 

20 Ser Asp Lys Val Met Gin Gly Tyr Ala Tyr lie Leu Thr His Pro Gly 

305 310 315 320 

ate cca tgc ate ttc tac gac cat ttc ttc aac tgg ggg ttt aag gac 1008 

lie Pro Cys lie Phe Tyr Asp His Phe Phe Asn Trp Gly Phe Lys Asp 

25 325 330 335 

cag ate gcg gcg ctg gtg gcg ate agg aag cgc aac ggc ate acg gcg 1056 

Gin He Ala Ala Leu Val Ala He Arg Lys Arg Asn Gly He Thr Ala 

340 345 350 

acg age get ctg aag ate etc atg cac gaa gga gat gcc tac gtc gcc 1104 

Thr Ser Ala Leu Lys He Leu Met His Glu Gly Asp Ala Tyr Val Ala 
355 360 365 

35 gag ata gac ggc aag gtg gtg gtg aag ate ggg tec agg tac gac gtc 1152 

Glu He Asp Gly Lys Val Val Val Lys He Gly Ser Arg Tyr Asp Val 
370 375 380 

ggg gcg gtg ate ccg gcc ggg ttc gtg acc teg gca cac ggc aac gac 1200 

40 Gly Ala Val He Pro Ala Gly Phe Val Thr Ser Ala His Gly Asn Asp 

385 390 395 400 

tac gcc gtc tgg gag aag aac ggt gcc gcg gca aca eta caa egg age 1248 

Tyr Ala Val Trp Glu Lys Asn Gly Ala Ala Ala Thr Leu Gin Arg Ser 

45 405 410 415 

<210> 88 
<211> 416 
50 <212> PRT 

<213> Hordeum vulgare 

<400> 88 

55 Met Ala His Gin Val Leu Phe Gin Gly Phe Asn Trp Glu Ser Trp Lys 

15 10 15 



30 
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Gin Ser Gly Gly Trp Tyr Asn Met Met Met Gly Lys Val Asp Asp lie 

20 25 30 

5 

Ala Ala Ala Gly Val Thr His Val Trp Leu Pro Pro Pro Ser His Ser 
35 40 45 

10 Val Ser Asn Glu Gly Tyr Met Pro Gly Arg Leu Tyr Asp lie Asp Ala 
50 55 60 

Ser Lys Tyr Gly Asn Ala Ala Glu Leu Lys Ser Leu lie Gly Ala Leu 
15 65 70 75 80 

His Gly Lys Gly Val Gin Ala lie Ala Asp lie Val lie Asn His Arg 

85 90 95 

20 

Cys Ala Asp Tyr Lys Asp Ser Arg Gly lie Tyr Cys lie Phe Glu Gly 

100 105 110 

25 

Gly Thr Ser Asp Gly Arg Leu Asp Trp Gly Pro His Met lie Cys Arg 
115 120 125 

30 Asp Asp Thr Lys Tyr Ser Asp Gly Thr Ala Asn Leu Asp Thr Gly Ala 
130 135 140 

Asp Phe Ala Ala Ala Pro Asp He Asp His Leu Asn Asp Arg Val Gin 
35 145 150 155 160 

Arg Glu Leu Lys Glu Trp Leu Leu Trp Leu Lys Ser Asp Leu Gly Phe 

165 170 175 

40 

Asp Ala Trp Arg Leu Asp Phe Ala Arg Gly Tyr Ser Pro Glu Met Ala 

180 185 190 

45 

Lys Val Tyr He Asp Gly Thr Ser Pro Ser Leu Ala Val Ala Glu Val 
195 200 205 

50 Trp Asp Asn Met Ala Thr Gly Gly Asp Gly Lys Pro Asn Tyr Asp Gin 
210 215 220 

Asp Ala His Arg Gin Asn Leu Val Asn Trp Val Asp Lys Val Gly Gly 
55 225 230 235 240 
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Ala Ala Ser Ala Gly Met Val Phe Asp Phe Thr Thr Lys Gly lie Leu 

245 250 255 



Asn Ala Ala Val Glu Gly Glu Leu Trp Arg Leu lie Asp Pro Gin Gly 

260 265 270 



Lys Ala Pro Gly Val Met Gly Trp Trp Pro Ala Lys Ala Val Thr Phe 
275 280 285 



Val Asp Asn His Asp Thr Gly Ser Thr Gin Ala Met Trp Pro Phe Pro 
290 295 300 



Ser Asp Lys Val Met Gin Gly Tyr Ala Tyr lie Leu Thr His Pro Gly 
305 310 315 320 



lie Pro Cys lie Phe Tyr Asp His Phe Phe Asn Trp Gly Phe Lys Asp 

325 330 335 



Gin lie Ala Ala Leu Val Ala lie Arg Lys Arg Asn Gly lie Thr Ala 

340 345 350 



Thr Ser Ala Leu Lys lie Leu Met His Glu Gly Asp Ala Tyr Val Ala 
355 360 365 



Glu lie Asp Gly Lys Val Val Val Lys lie Gly Ser Arg Tyr Asp Val 
370 375 380 



Gly Ala Val He Pro Ala Gly Phe Val Thr Ser Ala His Gly Asn Asp 
385 390 395 400 



Tyr Ala Val Trp Glu Lys Asn Gly Ala Ala Ala Thr Leu Gin Arg Ser 

405 410 415 



<210> 89 

<211> 1401 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Trichoderma reesei S-Glucosidase 2 



<220> 

<221> CDS 

<222> (1) . . (1401) 

<223> Trichoderma reesei S-Glucosidase 2 
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10 



<400> 89 

atg ttg ccc aag gac ttt cag tgg ggg ttc gcc acg get gec tac cag 4 8 

Met Leu Pro Lys Asp Phe Gin Trp Gly Phe Ala Thr Ala Ala Tyr Gin 
15 10 15 

ate gag ggc gcc gtc gac cag gac ggc cgc ggc ccc age ate tgg gac 96 

He Glu Gly Ala Val Asp Gin Asp Gly Arg Gly Pro Ser He Trp Asp 

20 25 30 

acg ttc tgc gcg cag ccc ggc aag ate gcc gac ggc teg teg ggc gtg 144 

Thr Phe Cys Ala Gin Pro Gly Lys He Ala Asp Gly Ser Ser Gly Val 
35 40 45 

15 acg gcg tgc gac teg tac aac cgc acg gcc gag gac att gcg ctg ctg 192 

Thr Ala Cys Asp Ser Tyr Asn Arg Thr Ala Glu Asp He Ala Leu Leu 
50 55 60 

aag teg etc ggg gcc aag age tac cgc ttc tec ate teg tgg teg cgc 240 

20 Lys Ser Leu Gly Ala Lys Ser Tyr Arg Phe Ser He Ser Trp Ser Arg 
65 70 75 80 

ate ate ccc gag ggc ggc cgc ggc gat gcc gtc aac cag gcg ggc ate 288 

He He Pro Glu Gly Gly Arg Gly Asp Ala Val Asn Gin Ala Gly He 

25 85 90 95 

gac cac tac gtc aag ttc gtc gac gac ctg etc gac gcc ggc ate acg 336 

Asp His Tyr Val Lys Phe Val Asp Asp Leu Leu Asp Ala Gly He Thr 

100 105 110 

ccc ttc ate acc etc ttc cac tgg gac ctg ccc gag ggc ctg cat cag 384 

Pro Phe He Thr Leu Phe His Trp Asp Leu Pro Glu Gly Leu His Gin 
115 120 125 

35 egg tac ggg ggg ctg ctg aac cgc acc gag ttc ccg etc gac ttt gaa 432 

Arg Tyr Gly Gly Leu Leu Asn Arg Thr Glu Phe Pro Leu Asp Phe Glu 
130 135 140 

aac tac gcc cgc gtc atg ttc agg gcg ctg ccc aag gtg cgc aac tgg 480 

40 Asn Tyr Ala Arg Val Met Phe Arg Ala Leu Pro Lys Val Arg Asn Trp 
145 150 155 160 

ate acc ttc aac gag ccg ctg tgc teg gcc ate ccg ggc tac ggc tec 528 

He Thr Phe Asn Glu Pro Leu Cys Ser Ala He Pro Gly Tyr Gly Ser 

45 165 170 175 

ggc acc ttc gcc ccc ggc egg cag age acc teg gag ccg tgg acc gtc 576 

Gly Thr Phe Ala Pro Gly Arg Gin Ser Thr Ser Glu Pro Trp Thr Val 

180 185 190 

50 

ggc cac aac ate etc gtc gcc cac ggc cgc gcc gtc aag gcg tac cgc 624 

Gly His Asn He Leu Val Ala His Gly Arg Ala Val Lys Ala Tyr Arg 
195 200 205 

55 gac gac ttc aag ccc gcc age ggc gac ggc cag ate ggc ate gtc etc 672 

Asp Asp Phe Lys Pro Ala Ser Gly Asp Gly Gin He Gly He Val Leu 
210 J 215 220 



30 



270 



WO 2005/096804 



PCT/US2004/007182 



aac ggc gac 
Asn Gly Asp 
225 

gag gcg gcc 
Glu Ala Ala 



ccc ate tac 
Pro lie Tyr 



gac egg ctg 
Asp Arg Leu 
275 

tec aac gac 
Ser Asn Asp 
290 

cac cgc age 
His Arg Ser 
305 

gtg etc ttc 
Val Leu Phe 



tec ccc tgg 
Ser Pro Trp 



tgg ate age 
Trp lie Ser 
355 

ggc acg age 
Gly Thr Ser 
370 

gaa gat gac 
Glu Asp Asp 
385 

gtt ace gcc 
Val Thr Ala 



tgg teg etc 
Trp Ser Leu 



ttt ggg gtt 
Phe Gly Val 
435 



ttc ace tac 
Phe Thr Tyr 
230 

gag egg cgc 
Glu Arg Arg 
245 

ttg ggc gac 
Leu Gly Asp 
260 

ccg ace ttt 
Pro Thr Phe 



ttt tac ggc 
Phe Tyr Gly 



teg ccc gcc 
Ser Pro Ala 
310 

ace aac aag 
Thr Asn Lys 
325 

ctg cgc ccc 
Leu Arg Pro 
340 

aag agg tac 

Lys Arg Tyr 



ate aag ggc 
lie Lys Gly 



ttc agg gtc 
Phe Arg Val 
390 

gtg gag ctg 
Val Glu Leu 
405 

atg gac aac 
Met Asp Asn 
420 

acg tat gtg 
Thr Tyr Val 



ccc tgg gac 
Pro Trp Asp 



etc gag ttc 
Leu Glu Phe 



tac ccg gcg 
Tyr Pro Ala 

265 

acg ccc gag 
Thr Pro Glu 
280 

atg aac cac 
Met Asn His 
295 

tec gcc gac 

Ser Ala Asp 



cag ggc aac 
Gin Gly Asn 



tgt gcc gcc 
Cys Ala Ala 
345 

ggc tac ccg 
Gly Tyr Pro 
360 

gag age gac 
Glu Ser Asp 
375 

aag tac tat 
Lys Tyr Tyr 



gac ggg gtc 
Asp Gly Val 



ttt gag tgg 
Phe Glu Trp 
425 

gat tat gag 
Asp Tyr Glu 
440 



gcc gcc gac 
Ala Ala Asp 
235 

ttc acg gcc 
Phe Thr Ala 
250 

teg atg cgc 
Ser Met Arg 



gag cgc gcc 
Glu Arg Ala 



tac acg tec 
Tyr Thr Ser 
300 

gac ace gtc 
Asp Thr Val 
315 

tgc ate ggc 
Cys lie Gly 
330 

ggc ttc cgc 
Gly Phe Arg 



ccc ate tac 
Pro lie Tyr 



ttg ccc aag 
Leu Pro Lys 
380 

aac gag tac 
Asn Glu Tyr 
395 

aac gtc aag 
Asn Val Lys 
410 

gcg gac ggc 
Ala Asp Gly 



aat ggg cag 
Asn Gly Gin 



ccg gcc gac 
Pro Ala Asp 



tgg ttc gcg 
Trp Phe Ala 
255 

aag cag ctg 
Lys Gin Leu 

270 

etc gtc cac 
Leu Val His 
285 

aac tac ate 
Asn Tyr lie 



ggc aac gtc 
Gly Asn Val 



ccc gag acg 
Pro Glu Thr 
335 

gac ttc ctg 
Asp Phe Leu 
350 

gtg acg gag 
Val Thr Glu 
365 

gag aag att 
Glu Lys lie 



ate cgt gcc 
lie Arg Ala 



ggg tac ttt 
Gly Tyr Phe 
415 

tac gtg acg 
Tyr Val Thr 
430 

aag egg ttc 
Lys Arg Phe 
445 



aag 72 0 

Lys 

240 

gac 768 
Asp 



ggc 816 

Gly 



ggc 864 

Gly 

cgc 912 
Arg 

gac 960 

Asp 

320 

cag 1008 
Gin 



gtg 1056 
Val 



aac 1104 
Asn 



etc 1152 
Leu 



atg 1200 

Met 

400 

gcc 1248 
Ala 



agg 1296 
Arg 



ccc 1344 
Pro 
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aag aag age gca aag age ttg aag ccg ctg ttt gac gag ctg att gcg 13 92 

Lys Lys Ser Ala Lys Ser Leu Lys Pro Leu Phe Asp Glu Leu lie Ala 
450 455 460 



gcg gcg tga 
Ala Ala 



465 




<210> 


90 


<211> 


466 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


90 



Met Leu Pro Lys Asp Phe Gin Trp Gly Phe Ala Thr Ala Ala Tyr Gin 
15 10 15 



lie Glu Gly Ala Val Asp Gin Asp Gly Arg Gly Pro Ser lie Trp Asp 

20 25 30 



Thr Phe Cys Ala Gin Pro Gly Lys lie Ala Asp Gly Ser Ser Gly Val 
35 40 45 



Thr Ala Cys Asp Ser Tyr Asn Arg Thr Ala Glu Asp lie Ala Leu Leu 
50 55 60 



Lys Ser Leu Gly Ala Lys Ser Tyr Arg Phe Ser lie Ser Trp Ser Arg 
65 70 75 80 



lie lie Pro Glu Gly Gly Arg Gly Asp Ala Val Asn Gin Ala Gly lie 

85 90 95 



Asp His Tyr Val Lys Phe Val Asp Asp Leu Leu Asp Ala Gly lie Thr 

100 105 110 



Pro Phe lie Thr Leu Phe His Trp Asp Leu Pro Glu Gly Leu His Gin 
115 120 125 



Arg Tyr Gly Gly Leu Leu Asn Arg Thr Glu Phe Pro Leu Asp Phe Glu 
130 135 140 



Asn Tyr Ala Arg Val Met Phe Arg Ala Leu Pro Lys Val Arg Asn Trp 
145 150 155 160 



1401 
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lie Thr Phe Asn 



Gly Thr Phe Ala 

180 



Gly His Asn lie 
195 



Asp Asp Phe Lys 
210 



Asn Gly Asp Phe 
225 



Glu Ala Ala Glu 



Pro lie Tyr Leu 

260 



Asp Arg Leu Pro 
275 



Ser Asn Asp Phe 
290 



His Arg Ser Ser 
305 



Val Leu Phe Thr 



Ser Pro Trp Leu 

340 



Trp lie Ser Lys 
3 55 



Gly Thr Ser lie 
370 



Glu Pro Leu Cys 
165 



Pro Gly Arg Gin 



Leu Val Ala His 

200 



Pro Ala Ser Gly 
215 



Thr Tyr Pro Trp 
230 



Arg Arg Leu Glu 
245 



Gly Asp Tyr Pro 



Thr Phe Thr Pro 

280 



Tyr Gly Met Asn 
295 



Pro Ala Ser Ala 
310 



Asn Lys Gin Gly 
325 



Arg Pro Cys Ala 



Arg Tyr Gly Tyr 

360 



Lys Gly Glu Ser 
375 



Ser Ala lie Pro 
170 



Ser Thr Ser Glu 
185 



Gly Arg Ala Val 



Asp Gly Gin lie 

220 



Asp Ala Ala Asp 
235 



Phe Phe Thr Ala 
250 



Ala Ser Met Arg 
265 



Glu Glu Arg Ala 



His Tyr Thr Ser 

300 



Asp Asp Thr Val 
315 



Asn Cys lie Gly 
330 



Ala Gly Phe Arg 
345 



Pro Pro lie Tyr 



Asp Leu Pro Lys 

380 



273 



Gly Tyr Gly Ser 
175 



Pro Trp Thr Val 
190 



Lys Ala Tyr Arg 
205 



Gly lie Val Leu 



Pro Ala Asp Lys 

240 



Trp Phe Ala Asp 
255 



Lys Gin Leu Gly 
270 



Leu Val His Gly 
285 



Asn Tyr lie Arg 



Gly Asn Val Asp 

320 



Pro Glu Thr Gin 
335 



Asp Phe Leu Val 
350 



Val Thr Glu Asn 
365 



Glu Lys lie Leu 
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Glu Asp Asp Phe Arg Val Lys Tyr Tyr Asn Glu Tyr lie Arg Ala Met 
385 ' ~ ^ 390 395 400 



Val Thr Ala Val Glu Leu Asp Gly Val Asn Val Lys Gly Tyr Phe Ala 

405 410 415 



Trp Ser Leu Met Asp Asn Phe Glu Trp Ala Asp Gly Tyr Val Thr Arg 

420 425 430 



Phe Gly Val Thr Tyr Val Asp Tyr Glu Asn Gly Gin Lys Arg Phe Pro 
435 440 445 



Lys Lys Ser Ala Lys Ser Leu Lys Pro Leu Phe Asp Glu Leu lie Ala 
450 455 460 



Ala Ala 
465 



<210> 91 

<211> 2103 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Trichoderma reesei fc-Glucosidase D 



<220> 

<221> CDS 

<222> (1) . . (2103) 

<223> Trichoderma reesei S-Glucosidase D 
<400> 91 

atg att etc ggc tgt gaa age aca ggt gtc ate tct gec gtc aaa cac 
Met He Leu Gly Cys Glu Ser Thr Gly Val He Ser Ala Val Lys His 
1 5 10 15 

ttt gtc gec aac gac cag gag cac gag egg cga gcg gtc gac tgt etc 
Phe Val Ala Asn Asp Gin Glu His Glu Arg Arg Ala Val Asp Cys Leu 

20 25 30 

ate acc cag egg get etc egg gag gtc tat ctg cga ccc ttc cag ate 
He Thr Gin Arg Ala Leu Arg Glu Val Tyr Leu Arg Pro Phe Gin He 
35 40 45 

gta gec cga gat gca agg ccc ggc gca ttg atg aca tec tac aac aag 
Val Ala Arg Asp Ala Arg Pro Gly Ala Leu Met Thr Ser Tyr Asn Lys 
50 55 60 

gtc aat ggc aag cac gtc get gac age gec gag ttc ctt cag ggc att 



48 



96 



144 



192 



240 
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Val Asn Gly Lys His Val Ala Asp Ser Ala Glu Phe Leu Gin Gly lie 
65 70 75 80 

etc egg act gag tgg aat tgg gac cct etc att gtc age gac tgg tac 288 
Leu Arg Thr Glu Trp Asn Trp Asp Pro Leu lie Val Ser Asp Trp Tyr 

85 90 95 

ggc acc tac acc act att gat gec ate aaa gec ggc ctt gat etc gag 336 
Gly Thr Tyr Thr Thr lie Asp Ala lie Lys Ala Gly Leu Asp Leu Glu 

100 105 HO 

atg ccg ggc gtt tea cga tat cgc ggc aaa tac ate gag tct get ctg 384 
Met Pro Gly Val Ser Arg Tyr Arg Gly Lys Tyr lie Glu Ser Ala Leu 
115 120 125 

cag gec cgt ttg ctg aag cag tec act ate gat gag cgc get cgc cgc 432 
Gin Ala Arg Leu Leu Lys Gin Ser Thr lie Asp Glu Arg Ala Arg Arg 
130 135 140 

gtg etc agg ttc gec cag aag gec age cat etc aag gtc tec gag gta 480 
Val Leu Arg Phe Ala Gin Lys Ala Ser His Leu Lys Val Ser Glu Val 
145 150 155 160 

gag caa ggc cgt gac ttc cca gag gat cgc gtc etc aac cgt cag ate 528 
Glu Gin Gly Arg Asp Phe Pro Glu Asp Arg Val Leu Asn Arg Gin lie 

165 170 175 

tgc ggc age age att gtc eta ctg aag aat gag aac tec ate tta cct 576 
Cys Gly Ser Ser lie Val Leu Leu Lys Asn Glu Asn Ser lie Leu Pro 

180 185 190 

etc ccc aag tec gtc aag aag gtc gec ctt gtt ggt tec cac gtg cgt 624 
Leu Pro Lys Ser Val Lys Lys Val Ala Leu Val Gly Ser His Val Arg 
195 200 205 

eta ccg get ate teg gga gga ggc age gec tct ctt gtc cct tac tat 672 
Leu Pro Ala lie Ser Gly Gly Gly Ser Ala Ser Leu Val Pro Tyr Tyr 
210 215 220 

gec ata tct eta tac gat gee gtc tct gag gta eta gec ggt gec acg 720 
Ala lie Ser Leu Tyr Asp Ala Val Ser Glu Val Leu Ala Gly Ala Thr 
225 230 235 240 

ate acg cac gag gtc ggt gee tat gec cac caa atg ctg ccc gtc ate 768 
lie Thr His Glu Val Gly Ala Tyr Ala His Gin Met Leu Pro Val lie 

245 250 255 

gac gca atg ate age aac gec gta ate cac ttc tac aac gac ccc ate 816 
Asp Ala Met lie Ser Asn Ala Val lie His Phe Tyr Asn Asp Pro He 

260 265 270 

gat gtc aaa gac aga aag etc ctt ggc agt gag aac gta teg teg aca 864 
Asp Val Lys Asp Arg Lys Leu Leu Gly Ser Glu Asn Val Ser Ser Thr 
275 280 285 

teg ttc cag etc atg gat tac aac aac ate cca acg etc aac aag gec 912 
Ser Phe Gin Leu Met Asp Tyr Asn Asn He Pro Thr Leu Asn Lys Ala 
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290 

atg ttc tgg 
Met Phe Trp 
305 

att tgg gaa 
lie Trp Glu 



gat aat gag 
Asp Asn Glu 



gcc ttt ttc 
Ala Phe Phe 
355 

gtg gcc ggc 
Val Ala Gly 
370 

acg acc aag 
Thr Thr Lys 
385 

gta cac ctg 
Val His Leu 



egg gcc gtc 
Arg Ala Val 



gga etc age 
Gly Leu Ser 
435 

gac ctg ccc 
Asp Leu Pro 
450 

get ccc aat 
Ala Pro Asn 
465 

age tgg get 
Ser Trp Ala 



aac gag aca 
Asn Glu Thr 



ccg teg ggg 
Pro Ser Gly 
515 



ggt act etc 
Gly Thr Leu 
310 

ttt ggc etc 
Phe Gly Leu 
325 

etc gtg att 
Leu Val lie 
340 

gga aag gga 
Gly Lys Gly 



age acc tac 
Ser Thr Tyr 



atg gag acg 
Met Glu Thr 
390 

ggt gcc tgt 
Gly Ala Cys 
405 

aag gcc gca 
Lys Ala Ala 
420 

ggc gag tgg 
Gly Glu Trp 



cct ggt gtg 
Pro Gly Val 



get gta gtc 
Ala Val Val 
470 

cat aaa gca 
His Lys Ala 
485 

ggc cac gga 
Gly His Gly 
500 

aaa etc tec 
Lys Leu Ser 



295 

gtg ggc gag 
Val Gly Glu 



agt gtc ttt 
Ser Val Phe 



gaa aat aca 
Glu Asn Thr 
345 

acg acg gaa 
Thr Thr Glu 
360 

aag ctg cgt 
Lys Leu Arg 
375 

acc ggt gtt 
Thr Gly Val 



etc aag gtc 
Leu Lys Val 



gcc gat gcc 
Ala Asp Ala 
425 

gag tct gag 
Glu Ser Glu 
440 

gac acc atg 
Asp Thr Met 
455 

gtc aac cag 
Val Asn Gin 



aag gcc att 
Lys Ala lie 



ate tec gat 
He Ser Asp 
505 

eta teg tgg 
Leu Ser Trp 
520 



300 

ttt ate cct 
Phe lie Pro 
315 

ggc act gcc 
Gly Thr Ala 
330 

aca cat cag 
Thr His Gin 



aaa gtc get 
Lys Val Ala 



etc gag ttt 
Leu Glu Phe 
380 

gtc aac ttt 
Val Asn Phe 
395 

gac cca cag 
Asp Pro Gin 
410 

gac tac acc 
Asp Tyr Thr 



ggt ttt gac 
Gly Phe Asp 



ate teg caa 
He Ser Gin 
460 

tea ggc acc 
Ser Gly Thr 
475 

gtg cag get 
Val Gin Ala 
490 

gtg etc ttt 
Val Leu Phe 



cca gtc gat 
Pro Val Asp 



acc gcc acg 
Thr Ala Thr 



gac ctt tat 
Asp Leu Tyr 
335 

acg cgt gga 
Thr Arg Gly 
350 

acc agg agg 
Thr Arg Arg 
365 

ggg tct gcc 
Gly Ser Ala 



ggc ggc ggt 
Gly Gly Gly 



gag atg att 
Glu Met He 
415 

ate ate tgc 
He He Cys 
430 

egg cct cac 
Arg Pro His 
445 

gtt ctt gac 
Val Leu Asp 



cca gtg aca 
Pro Val Thr 



tgg tat ggt 
Trp Tyr Gly 
495 

ggc aac gtc 
Gly Asn Val 
510 

gtg aag cac 
Val Lys His 
525 



gga 960 

Gly 

320 

att 1008 
He 



acc 1056 
Thr 



atg 1104 
Met 



aac 1152 
Asn 



gcc 1200 

Ala 

400 

gcg 1248 
Ala 



acg 1296 
Thr 



atg 1344 
Met 



gcc 1392 
Ala 



atg 1440 

Met 

480 

ggt 1488 
Gly 

aac 1536 
Asn 



aac 1584 
Asn 
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cca gca tat etc aac tac gec age gtt ggt gga egg gtc ttg tat ggc 1632 
Pro Ala Tyr Leu Asn Tyr Ala Ser Val Gly Gly Arg Val Leu Tyr Gly 
530 535 540 



gag gat gtt tac gtt ggc tac aag ttc tac gac aaa acg gag agg gag 
Glu Asp Val Tyr Val Gly Tyr Lys Phe Tyr Asp Lys Thr Glu Arg Glu 
545 550 555 560 



1680 



gtt ctg ttt cct ttt ggg cat ggc ctg tct tac get acc ttc aag etc 1728 
Val Leu Phe Pro Phe Gly His Gly Leu Ser Tyr Ala Thr Phe Lys Leu 

565 570 575 

cca gat tct acc gtg agg acg gtc ccc gaa acc ttc cac ccg gac cag 1776 
Pro Asp Ser Thr Val Arg Thr Val Pro Glu Thr Phe His Pro Asp Gin 

580 585 590 



1824 



ccc aca gta gec att gtc aag ate aag aac acg age agt gtc ccg ggc 

Pro Thr Val Ala He Val Lys He Lys Asn Thr Ser Ser Val Pro Gly 

595 600 605 

gee cag gtc ctg cag tta tac att teg gec cca aac teg cct aca cat 1872 

Ala Gin Val Leu Gin Leu Tyr He Ser Ala Pro Asn Ser Pro Thr His 

610 615 620 

cgc ccg gtc aag gag ctg cac gga ttc gaa aag gtg tat ctt gaa get 1920 

Arg Pro Val Lys Glu Leu His Gly Phe Glu Lys Val Tyr Leu Glu Ala 
625 630 635 640 

ggc gag gag aag gag gta caa ata ccc att gac cag tac get act age 1968 

Gly Glu Glu Lys Glu Val Gin He Pro He Asp Gin Tyr Ala Thr Ser 

645 650 655 

ttc tgg gac gag att gag age atg tgg aag age gag agg ggc att tat 2 016 

Phe Trp Asp Glu He Glu Ser Met Trp Lys Ser Glu Arg Gly He Tyr 

660 665 670 

gat gtg ctt gta gga ttc teg agt cag gaa ate teg ggc aag ggg aag 2064 

Asp Val Leu Val Gly Phe Ser Ser Gin Glu He Ser Gly Lys Gly Lys 

675 680 685 

ctg att gtg cct gaa acg cga ttc tgg atg ggg ctg tag 2103 

Leu He Val Pro Glu Thr Arg Phe Trp Met Gly Leu 

690 695 700 



<210> 92 

<211> 700 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 92 

Met He Leu Gly Cys Glu Ser Thr Gly Val He Ser Ala Val Lys His 
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10 15 



Phe Val Ala Asn Asp Gin Glu His Glu Arg Arg Ala Val Asp Cys Leu 
5 20 25 30 

lie Thr Gin Arg Ala Leu Arg Glu Val Tyr Leu Arg Pro Phe Gin lie 
35 40 45 

10 

Val Ala Arg Asp Ala Arg Pro Gly Ala Leu Met Thr Ser Tyr Asn Lys 
50 55 60 

15 

Val Asn Gly Lys His Val Ala Asp Ser Ala Glu Phe Leu Gin Gly lie 
65 70 75 80 



20 Leu Arg Thr Glu Trp Asn Trp Asp Pro Leu He Val Ser Asp Trp Tyr 

85 90 95 



Gly Thr Tyr Thr Thr He Asp Ala He Lys Ala Gly Leu Asp Leu Glu 
25 100 105 110 

Met Pro Gly Val Ser Arg Tyr Arg Gly Lys Tyr He Glu Ser Ala Leu 
115 120 125 

30 

Gin Ala Arg Leu Leu Lys Gin Ser Thr He Asp Glu Arg Ala Arg Arg 
130 135 140 

35 

Val Leu Arg Phe Ala Gin Lys Ala Ser His Leu Lys Val Ser Glu Val 
145 150 155 160 

40 Glu Gin Gly Arg Asp Phe Pro Glu Asp Arg Val Leu Asn Arg Gin He 

165 170 175 



Cys Gly Ser Ser He Val Leu Leu Lys Asn Glu Asn Ser He Leu Pro 
45 180 185 190 

Leu Pro Lys Ser Val Lys Lys Val Ala Leu Val Gly Ser His Val Arg 
195 200 205 

50 

Leu Pro Ala He Ser Gly Gly Gly Ser Ala Ser Leu Val Pro Tyr Tyr 
210 215 220 

55 

Ala He Ser Leu Tyr Asp Ala Val Ser Glu Val Leu Ala Gly Ala Thr 
225 230 235 240 
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lie Thr His Glu Val Gly Ala Tyr Ala His Gin Met Leu Pro Val lie 

245 250 255 

5 

Asp Ala Met lie Ser Asn Ala Val lie His Phe Tyr Asn Asp Pro lie 

260 265 270 

10 

Asp Val Lys Asp Arg Lys Leu Leu Gly Ser Glu Asn Val Ser Ser Thr 
275 280 285 

15 Ser Phe Gin Leu Met Asp Tyr Asn Asn lie Pro Thr Leu Asn Lys Ala 
290 295 300 

Met Phe Trp Gly Thr Leu Val Gly Glu Phe lie Pro Thr Ala Thr Gly 
20 305 310 315 320 

lie Trp Glu Phe Gly Leu Ser Val Phe Gly Thr Ala Asp Leu Tyr lie 

325 330 335 

25 

Asp Asn Glu Leu Val lie Glu Asn Thr Thr His Gin Thr Arg Gly Thr 

340 345 350 

30 

Ala Phe Phe Gly Lys Gly Thr Thr Glu Lys Val Ala Thr Arg Arg Met 
355 360 365 



35 Val Ala Gly Ser Thr Tyr Lys Leu Arg Leu Glu Phe Gly Ser Ala Asn 
370 375 380 

Thr Thr Lys Met Glu Thr Thr Gly Val Val Asn Phe Gly Gly Gly Ala 
40 385 390 395 400 

Val His Leu Gly Ala Cys Leu Lys Val Asp Pro Gin Glu Met lie Ala 

405 410 415 

45 

Arg Ala Val Lys Ala Ala Ala Asp Ala Asp Tyr Thr He He Cys Thr 

420 425 430 

50 

Gly Leu Ser Gly Glu Trp Glu Ser Glu Gly Phe Asp Arg Pro His Met 
435 440 445 

55 Asp Leu Pro Pro Gly Val Asp Thr Met He Ser Gin Val Leu Asp Ala 
450 455 460 
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Ala Pro Asn Ala Val Val Val Asn Gin Ser Gly Thr Pro Val Thr Met 
465 470 475 480 

5 

Ser Trp Ala His Lys Ala Lys Ala He Val Gin Ala Trp Tyr Gly Gly 

485 490 495 

10 Asn Glu Thr Gly His Gly He Ser Asp Val Leu Phe Gly Asn Val Asn 

500 505 510 

Pro Ser Gly Lys Leu Ser Leu Ser Trp Pro Val Asp Val Lys His Asn 
15 515 520 525 

Pro Ala Tyr Leu Asn Tyr Ala Ser Val Gly Gly Arg Val Leu Tyr Gly 
530 535 540 

20 

Glu Asp Val Tyr Val Gly Tyr Lys Phe Tyr Asp Lys Thr Glu Arg Glu 
545 ~ 550 555 560 

25 

Val Leu Phe Pro Phe Gly His Gly Leu Ser Tyr Ala Thr Phe Lys Leu 

565 570 575 

30 Pro Asp Ser Thr Val Arg Thr Val Pro Glu Thr Phe His Pro Asp Gin 

580 585 590 

Pro Thr Val Ala He Val Lys He Lys Asn Thr Ser Ser Val Pro Gly 
35 595 600 605 

Ala Gin Val Leu Gin Leu Tyr He Ser Ala Pro Asn Ser Pro Thr His 
610 615 620 

40 

Arg Pro Val Lys Glu Leu His Gly Phe Glu Lys Val Tyr Leu Glu Ala 
625 630 635 640 

45 

Gly Glu Glu Lys Glu Val Gin He Pro He Asp Gin Tyr Ala Thr Ser 

645 650 655 



50 Phe Trp Asp Glu He Glu Ser Met Trp Lys Ser Glu Arg Gly He Tyr 

660 665 670 

Asp Val Leu Val Gly Phe Ser Ser Gin Glu He Ser Gly Lys Gly Lys 
55 675 680 685 
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Leu lie Val Pro Glu Thr Arg Phe Trp Met Gly Leu 



<210> 93 
<211> 1496 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Maize optimized CBHI 
<400> 93 

tgcagtccgc ctgcaccctc cagtccgaga cccacccgcc gctcacctgg cagaagtgct 60 

cctccggcgg cacctgcacc cagcagaccg gctccgtggt gatcgacgcc aactggcgct 120 

ggacccacgc caccaactcc tccaccaact gctacgacgg caacacctgg tcctccaccc 180 

tctgcccgga caacgagacc tgcgccaaga actgctgcct cgacggcgcc gcctacgcct 240 

ccacctacgg cgtgaccacc tccggcaact ccctctccat cggcttcgtg acccagtccg 300 

cccagaagaa cgtgggcgcc cgcctctacc tcatggcctc cgacaccacc taccaggagt 360 

tcaccctcct cggcaacgag ttctccttcg acgtggacgt gtcccagctc ccgtgcggcc 420 

tcaacggcgc cctctacttc gtgtccatgg acgccgacgg cggcgtgtcc aagtacccga 480 

ccaacaccgc cggcgccaag tacggcaccg gctactgcga ctcccagtgc ccgcgcgacc 540 

tcaagttcat caacggccag gccaacgtgg agggctggga gccgtcctcc aacaacgcca 600 

acaccggcat cggcggccac ggctcctgct gctccgagat ggacatctgg gaggccaact 660 

ccatctccga ggccctcacc ccgcacccgt gcaccaccgt gggccaggag atctgcgagg 720 

gcgacggctg cggcggcacc tactccgaca accgctacgg cggcacctgc gacccggacg 780 

gctgcgactg gaacccgtac cgcctcggca acacctcctt ctacggcccg ggctcctcct 840 

tcaccctcga caccaccaag aagctcaccg tggtgaccca gttcgagacc tccggcgcca 900 

tcaaccgcta ctacgtgcag aacggcgtga ccttccagca gccgaacgcc gagctcggct 960 

cctactccgg caacgagctc aacgacgact actgcaccgc cgaggaggcc gagttcggcg 1020 

gctcctcctt ctccgacaag ggcggcctca cccagttcaa gaaggccacc tccggcggca 1080 

tggtgctcgt gatgtccctc tgggacgact actacgccaa catgctctgg ctcgactcca 1140 

cctacccgac caacgagacc tcctccaccc cgggcgccgt gcgcggctcc tgctccacct 1200 

cctccggcgt gccggcccag gtggagtccc agtccccgaa cgccaaggtg accttctcca 1260 

acatcaagtt cggcccgatc ggctccaccg gcaacccgtc cggcggcaac ccgccgggcg 1320 
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gcaacccgcc 


gggcaccacc 


accacccgcc 


gcccggccac 


caccaccggc 


tcctccccgg 


1380 


gcccgaccca 


gtcccactac 


ggccagtgcg 


gcggcatcgg 


ctactccggc 


ccgaccgtgt 


1440 


gcgcctccgg 


caccacctgc 


caggtgctca 


acccgtacta 


ctcccagtgc 


ctctag 


1496 


<210> 94 
<211> 1365 
<212> DNA 

<213> Artificial Sequence 










<220> 

<223> Maize optimized CBHII 










<400> 94 
ataataccac 


tcqaqqaqcq 


ccaggcctgc 


tcctccgtgt 


ggggccagtg 


cggcggccag 


60 


aactaqtccq 


qcccqacctg 


ctgcgcctcc 


ggctccacct 


gcgtgtactc 


caacgactac 


120 


tactcccaqt 


qcctcccggg 


cgccgcctcc 


tcctcctcct 


ccacccgcgc 


cgcctccacc 


180 


acctcccqcq 


tqtccccgac 

-_r* - J 


cacc tcccgc 


tcctcct ccg 


ccaccccgcc 


gccgggctcc 


240 


accaccaccc 


gcgtgccgcc 


ggtgggctcc 


ggcaccgcca 


cctactccgg 


caacccgttc 


300 


q tqqqcqtqa 


CCCCqtqqqc 

/ — ' ■ — ' mm* 


caacgcctac 


tacgcctccg 


aggtgtcctc 


cctcgccatc 


360 


ccgtccctca 


ccqqcqccat 


ggccaccgcc 


gccgccgccg 


tggccaaggt 


gccgtccttc 


420 


atgtggctcg 


acaccctcga 


caagaccccg 


ctcatggagc 


agaccctcgc 


cgacatccgc 


480 


accgccaaca 


agaacggcgg 


caactacgcc 


ggccagttcg 


tggtgtacga 


cctcccggac 


540 


cgcgactgcg 


ccgccctcgc 


ctccaacggc 


gagtactcca 


tcgccgacgg 


cggcgtggcc 


600 


aagtacaaga 


^ 4— m W 

accacdLcga 


r* a f* f* t~ t~* /""* f~T (~* 


cagatcgtgg 


tggagtactc 


cgacatccgc 


660 


accctcctcg 


tgatcgagcc 


ggactccctc 


gccaacctcg 


tgaccaacct 


cggcaccccg 


720 


aagtgcgcca 


acgcccagtc 


cgcctacctc 


gagtgcatca 


actacgccgt 


gacccagctc 


780 


aacctcccga 


acgtggccat 


gtacctcgac 


gccggccacg 


ccggctggct 


cggctggccg 


840 


gccaaccagg 


acccggccgc 


ccagctcttc 


gccaacgtgt 


acaagaacgc 


ctcctccccg 


900 


cgcgccctcc 


gcggcctcgc 


caccaacgtg 


gccaactaca 


acggctggaa 


catcacctcc 


960 


ccgccgtcct 


acacccaggg 


caacgccgtg 


tacaacgaga 


agctctacat 


ccacgccatc 


1020 


ggcccgctcc 


tcgccaacca 


cggctggtcc 


aacgccttct 


tcatcaccga 


ccagggccgc 


1080 


tccggcaagc 


agccgaccgg 


ccagcagcag 


tggggcgact 


ggtgcaacgt 


gatcggcacc 


1140 


ggcttcggca 


tccgcccgtc 


cgccaacacc 


ggcgactccc 


tcctcgactc 


cttcgtgtgg 


1200 


gtgaagccgg 


gcggcgagtg 


cgacggcacc 


tccgactcct 


ccgccccgcg 


cttcgactcc 


1260 
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cactgcgccc tcccggacgc cctccagccg gccccgcagg ccggcgcctg gttccaggcc 1320 
tacttcgtgc agctcctcac caacgccaac ccgtccttcc tctag 1365 



5 



<210> 95 

<211> 1317 

<212> DNA 

10 <213> Artificial Sequence 

<220> 

<223> Maize optimized EGLI 



15 


<400> 95 
atgcagcagc 


cgggcacctc 


caccccggag 


gtgcacccga 


agctcaccac 


ctacaagtgc 




accaagtccg 


gcggctgcgt 


ggcccaggac 


acctccgtgg 


tgctcgactg 


gaactaccgc 


20 


tggatgcacg 


acgccaacta 


caactcctgc 


accgtgaacg 


gcggcgtgaa 


caccaccctc 




tgcccggacg 


aggccacctg 


cggcaagaac 


tgcttcatcg 


agggcgtgga 


ctacgccgcc 


25 


tccggcgtga 


ccacctccgg 


ctcc tccctc 


accatgaacc 


agtacatgcc 


gtcc tcctcc 


ggcggctact 


cctccgtgtc 


cccgcgcc tc 


tacctcctcg 


actccgacgg 


cgagtacgtg 




atgctcaagc 


tcaacggcca 


ggagctctcc 


ttcgacgtgg 


acctctccgc 


cctcccgtgc 


30 


ggcgagaacg 


gctccctcta 


cctctcccag 


atggacgaga 


acggcggcgc 


caaccagtac 




aacaccgccg 


gcgccaacta 


cggctccggc 


tactgcgacg 


cccagtgccc 


ggtgcagacc 


35 


tggcgcaacg 


gcaccctcaa 


cacctcccac 


cagggcttct 


gctgcaacga 


gatggacatc 


c tCQaQQQCa 


actcccgcgc 


caacgccctc 


accccgcact 


cctgcaccgc 


caccgcctgc 




gactccgccg 


gctgcggctt 


caacccgtac 


ggctccggct 


acaagtccta 


ctacggcccg 


40 


ggcgacaccg 


tggacacctc 


caagaccttc 


accatcatca 


cccagttcaa 


caccgacaac 




ggctccccgt 


ccggcaacct 


cgtgtccatc 


acccgcaagt 


accagcagaa 


cggcgtggac 


45 


atcccgtccg 


cccagccggg 


cggcgacacc 


atctcctcct 


gcccgtccgc 


ctccgcctac 


ggcggcctcg 


ccaccatggg 


caaggccctc 


tcctccggca 


tggtgctcgt 


gttctccatc 




tggaacgaca 


actcccagta 


catgaactgg 


ctcgactccg 


gcaacgccgg 


cccgtgctcc 


50 


tccaccgagg 


gcaacccgtc 


caacaccctc 


gccaacaacc 


cgaacaccca 


cgtggtgttc 




tccaacatcc 


gctggggcga 


catcggctcc 


accaccaact 


ccaccgcccc 


gccgccgccg 


55 


ccggcctcct 


ccaccacctt 


ctccaccacc 


cgccgctcct 


ccaccacctc 


ctcctccccg 


tcctgcaccc 


agacccactg 


gggccagtgc 


ggcggcatcg 


gctactccgg 


ctgcaagacc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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<210> 96 

5 <211> 1401 

<212> DNA 

<213> Artificial Sequence 

<220> 

10 <223> Maize optimized BGLII 

<400> 96 





atgctcccga 


aggac ttcca 


gtggggcttc 


gccaccgccg 


cctaccagat 


cgagggcgcc 


60 


15 


gtggaccagg 


acggccgcgg 


cccgtccatc 


tgggacacct 


tctgcgccca 


geegggcaag 


120 




atcgccgacg 


gctcctccgg 


cgtgaccgcc 


tgcgactcct 


acaaccgcac 


cgccgaggac 


180 


20 


atcgccctcc 


tcaagtccct 


cggcgccaag 


tec taccget 


tctccatctc 


ctggtcccgc 


240 


atcatcccgg 


agggeggecg 


cggcgacgcc 


gtgaaccagg 


ccggcatcga 


ccactacgtg 


300 




aagttcgtgg 


acgacctcct 


cgacgccggc 


atcaccccgt 


tcatcaccct 


et tccactgg 


360 


25 


gacctcccgg 


agggcctcca 


ccagcgctac 


ggcggcctcc 


tcaaccgcac 


cgagttcccg 


420 




ctcgact teg 


agaactaege 


ccgcgtgatg 


t tccgcgccc 


tcccgaaggt 


gcgcaactgg 


480 


30 


atcaccttca 


acgagccgct 


ctgctccgcc 


atcccgggct 


acggctccgg 


caccttcgcc 


540 


ccgggccgcc 


agtccacctc 


cgagccgtgg 


accgtgggcc 


acaacatcct 


cgtggcccac 


600 




ggccgcgccg 


tgaaggecta 


ccgcgacgac 


ttcaagcegg 


cctccggcga 


cggccagatc 


660 


35 


ggcatcgtgc 


teaaeggega 


cttcacctac 


ccgtgggacg 


ccgccgaccc 


ggccgacaag 


720 




gaggccgccg 


agcgccgcct 


cgagttcttc 


accgcctggt 


tcgccgaccc 


gatctacctc 


780 


40 


ggcgactacc 


cggcctccat 


gcgcaagcag 


ctcggcgacc 


gcctcccgac 


cttcaccccg 


840 


gaggagegeg 


ccctcgtgca 


cggctccaac 


gacttctacg 


gcatgaacca 


ctacacctcc 


900 




aactacatcc 


gccaccgctc 


ctccccggcc 


tccgccgacg 


acaccgtggg 


caacgtggac 


960 


45 


gtgetcttea 


ccaacaagca 


gggcaactgc 


atcggcccgg 


agacccagtc 


cccgtggctc 


1020 




cgcccgtgcg 


ccgccggctt 


ccgcgacttc 


ctcgtgtgga 


tctccaagcg 


ctacggctac 


1080 


50 


ccgccgatct 


aegtgacega 


gaacggcacc 


tccatcaagg 


gegagtcega 


cctcccgaag 


1140 


gagaagatcc 


tcgaggacga 


cttccgcgtg 


aagtactaca 


acgagtacat 


ccgcgccatg 


1200 




gtgaccgccg 


tggagctcga 


eggegtgaac 


gtgaagggct 


acttcgcctg 


gtccctcatg 


1260 


55 


gacaacttcg 


agtgggccga 


eggctaegtg 


acccgcttcg 


gcgtgaccta 


cgtggactac 


1320 




gagaaeggee 


agaagegett 


cccgaagaag 


tccgccaagt 


ccctcaagcc 


gctcttcgac 


1380 
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gagctcatcg ccgccgccta g 1401 



5 <210> 97 

<211> 2103 

<212> DNA 

<213> Artificial Sequence 

10 <220> 

<223> Maize optimized CEL3D 

<400> 97 

atgatcctcg gctgcgagtc caccggcgtg atctccgccg tgaagcactt cgtggccaac 60 

15 

gaccaggagc acgagcgccg cgccgtggac tgcctcatca cccagcgcgc cctccgcgag 120 

gtgtacctcc gcccgttcca gatcgtggcc cgcgacgccc gcccgggcgc cctcatgacc 180 

20 tcctacaaca aggtgaacgg caagcacgtg gccgactccg ccgagttcct ccagggcatc 240 

ctccgcaccg agtggaactg ggacccgctc atcgtgtccg actggtacgg cacctacacc 300 

accatcgacg ccatcaaggc cggcctcgac ctcgagatgc cgggcgtgtc ccgctaccgc 360 

25 

ggcaagtaca tcgagtccgc cctccaggcc cgcctcctca agcagtccac catcgacgag 420 

cgcgcccgcc gcgtgctccg cttcgcccag aaggcctccc acctcaaggt gtccgaggtg 480 

30 gagcagggcc gcgacttccc ggaggaccgc gtgctcaacc gccagatctg cggctcctcc 54 0 

atcgtgctcc tcaagaacga gaactccatc ctcccgctcc cgaagtccgt gaagaaggtg 600 

gccctcgtgg gctcccacgt gcgcctcccg gccatctccg gcggcggctc cgcctccctc 660 

35 

gtgccgtact acgccatctc cctctacgac gccgtgtccg aggtgctcgc cggcgccacc 720 

atcacccacg aggtgggcgc ctacgcccac cagatgctcc cggtgatcga cgccatgatc 780 

40 tccaacgccg tgatccactt ctacaacgac ccgatcgacg tgaaggaccg caagctcctc 840 

ggctccgaga acgtgtcctc cacctccttc cagctcatgg actacaacaa catcccgacc 900 

ctcaacaagg ccatgttctg gggcaccctc gtgggcgagt tcatcccgac cgccaccggc 960 

45 

atctgggagt tcggcctctc cgtgttcggc accgccgacc tctacatcga caacgagctc 1020 

gtgatcgaga acaccaccca ccagacccgc ggcaccgcct tcttcggcaa gggcaccacc 1080 

50 gagaaggtgg ccacccgccg catggtggcc ggctccacct acaagctccg cctcgagttc 1140 

ggctccgcca acaccaccaa gatggagacc accggcgtgg tgaacttcgg cggcggcgcc 1200 

gtgcacctcg gcgcctgcct caaggtggac ccgcaggaga tgatcgcccg cgccgtgaag 1260 

55 

gccgccgccg acgccgacta caccatcatc tgcaccggcc tctccggcga gtgggagtcc 1320 
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gagggcttcg 


accgcccgca 


catggacctc 


ccgccgggcg 


tggacaccat 


gatctcccag 


1380 




gtgctcgacg 


ccgccccgaa 


cgccgtggtg 


gtgaaccagt 


ccggcacccc 


ggtgaccatg 


1440 


5 


tcctgggccc 


acaaggccaa 


ggccatcgtg 


caggcctggt 


acggcggcaa 


cgagaccggc 


1500 




cacggcatct 


ccgacgtgct 


cttcggcaac 


gtgaacccgt 


ccggcaagct 


ctccctctcc 


1560 


10 


tggccggtgg 


acgtgaagca 


caacccggcc 


tacctcaact 


acgcctccgt 


gggcggccgc 


1620 


gtgctctacg 


gcgaggacgt 


gtacgtgggc 


tacaagttct 


acgacaagac 


cgagcgcgag 


1680 




gtgctcttcc 


cgttcggcca 


cggcctctcc 


tacgccacct 


tcaagctccc 


ggactccacc 


1740 


15 


gtgcgcaccg 


tgccggagac 


ct tccacccg 


gaccagccga 


ccgtggccat 


cgtgaagatc 


1800 




aagaacacct 


cctccgtgcc 


gggcgcccag 


gtgctccagc 


tc tacatctc 


cgccccgaac 


1860 


20 


t ccccgaccc 


accgcccggt 


gaaggagctc 


cacggcttcg 


agaaggtgta 


cctcgaggcc 


1920 


ggcgaggaga 


aggaggtgca 


gatcccgatc 


gaccagtacg 


ccacctcctt 


ctgggacgag 


1980 




atcgagtcca 


tgtggaagt c 


cgagcgcggc 


at ctacgacg 


tgctcgtggg 


ct tctcctcc 


2040 


25 


caggagatct 
tag 


ccggcaaggg 


caagctca tc 


gtgccggaga 


cccgcttctg 


gatgggcctc 


2100 
2103 


30 


<210> 98 
<211> 420 
<212> DNA 
<213> Zea 


mays 












35 


<220> 

<223> Q protein promoter 










40 


<400> 98 
gggctggtaa 


attacttggg 


agcaatggta 


tgcaaatcct 


ttgcatgtac 


gcaaaactag 


60 


ctagttgtca 


caagttgtat 


atcgattcgt 


cgcgtttcaa 


caactcatgc 


aacattacaa 


120 




acaagtaaca 


caatattaca 


aagttagttt 


catacaaagc 


aagaaaagga 


caataatact 


180 


45 


tgacatgtaa 


agtgaagctt 


attatacttc 


ctaatccaac 


acaaaacaaa 


aaaaagttgc 


240 




acaaaggtcc 


aaaaatccac 


atcaaccatt 


aacctatacg 


taaagtgagt 


gatgagtcac 


300 


50 


attatccaac 


aaatgtttat 


caatgtggta 


tcatacaagc 


attgacatcc 


cataaatgca 


360 


agaaattgtg 


ccaacaaagc 


tataagtaac 


cctcatatgt 


atttgcactc 


atgcatcaca 


420 



<210> 99 

55 <211> 1188 

<212> DNA 

<213> artificial sequence 
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<220> 

<223> synthetic ferulic acid esterase 



5 


<400> 99 
atggccgcct 


ccctcccgac 


catgccgccg 


tccggctacg 


accaggtgcg 


caaeggegtg 




ccgcgcggcc 


aggtggtgaa 


catctcctac 


ttctccaccg 


ccaccaactc 


cacccgcccg 


10 


gcccgcgtgt 


acctcccgcc 


gggctactcc 


aaggacaaga 


agtactccgt 


gctctacctc 




ctccacggca 


tcggcggctc 


cgagaacgac 


tggttcgagg 


gcggcggccg 


cgccaacgtg 


15 


atcgccgaca 


acctcatcgc 


cgagggcaag 


atcaagccgc 


teatcategt 


gaccccgaac 


accaacgccg 


ccggcccggg 


catcgccgac 


ggctacgaga 


act tcaccaa 


ggacctcctc 




aactccctca 


tcccgtacat 


cgagtccaac 


tactccgtgt 


acaccgaccg 


cgagcaccgc 


20 


gccatcgccg 


gcct ctctat 


gggcggcggc 


cagtcct tea 


acatcggcct 


caccaacctc 




gacaagttcg 


cctacatcgg 


cccgatctcc 


gccgccccga 


acacctaccc 


gaacgagege 


25 


ctct tcccgg 


acggcggcaa 


ggccgcccgc 


gagaagctca 


agctcct ct t 


catcgcctgc 


ggcaccaacg 


ac tccctcat 


cggcttcggc 


cagcgcgtgc 


acgagtactg 


cgtggccaac 




aacatcaacc 


acgtgtactg 


gctcatccag 


ggcggcggcc 


acgact tcaa 


cgtgtggaag 


30 


ccgggcctct 


ggaact tcct 


ccagatggcc 


gaegaggecg 


gcctcacccg 


cgacggcaac 




accccggtgc 


cgaccccgtc 


cccgaagccg 


gccaacaccc 


gcatcgaggc 


cgaggactac 


35 


qacqqcatca 


actcctcctc 


catcgagatc 


atcggcgtgc 


cgccggaggg 


cggccgcggc 


atcggctaca 


tcacctccgg 


cgactacctc 


gtgtacaagt 


ccatcgactt 


cggcaacggc 




gccacctcct 


tcaaggccaa 


ggtggccaac 


gccaacacct 


ccaacatcga 


gcttcgcctc 


40 


aacggcccga 


acggcaccct 


catcggcacc 


ctctccgtga 


agtccaccgg 


cgactggaac 




acctacgagg 


agcagacctg 


ctccatctcc 


aaggtgaccg 


gcatcaacga 


cctctacctc 




gtgttcaagg 


gcccggtgaa 


catcgactgg 


ttcaccttcg 


gcgtgtag 





45 

<210> 100 

<211> 395 

<212> PRT 

50 <213> artificial sequence 

<220> 

<223> synthetic ferulic acid esterase 

55 <400> 100 

Met Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr Asp Gin Val 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1188 
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10 15 



Arg Asn Gly Val Pro Arg Gly Gin Val Val Asn lie Ser Tyr Phe Ser 
5 20 25 30 

Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu Pro Pro Gly 
35 40 45 

10 

Tyr Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu His Gly He 
50 55 60 

15 

Gly Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg Ala Asn Val 
65 70 75 80 

20 He Ala Asp Asn Leu He Ala Glu Gly Lys He Lys Pro Leu He He 

85 90 95 



Val Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly He Ala Asp Gly Tyr 
25 100 105 110 



Glu Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu He Pro Tyr He Glu 
115 120 125 

30 

Ser Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala He Ala Gly 
130 135 140 

35 

Leu Ser Met Gly Gly Gly Gin Ser Phe Asn He Gly Leu Thr Asn Leu 
145 150 155 160 



40 Asp Lys Phe Ala Tyr He Gly Pro He Ser Ala Ala Pro Asn Thr Tyr 

165 170 175 



Pro Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala Arg Glu Lys 
45 180 185 190 



Leu Lys Leu Leu Phe He Ala Cys Gly Thr Asn Asp Ser Leu He Gly 
195 200 205 

50 

Phe Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn He Asn His 
210 215 220 

55 

Val Tyr Trp Leu He Gin Gly Gly Gly His Asp Phe Asn Val Trp Lys 
225 ^ 230 235 240 
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Pro Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala Gly Leu Thr 

245 250 255 



Arg Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys Pro Ala Asn 

260 265 270 



Thr Arg lie Glu Ala Glu Asp Tyr Asp Gly lie Asn Ser Ser Ser lie 
275 280 285 



Glu He He Gly Val Pro Pro Glu Gly Gly Arg Gly He Gly Tyr He 
290 295 300 



Thr Ser Gly Asp Tyr Leu Val Tyr Lys Ser He Asp Phe Gly Asn Gly 
305 310 315 320 



Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr Ser Asn He 

325 330 335 



Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu He Gly Thr Leu Ser 

340 345 350 



Val Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin Thr Cys Ser 
355 360 365 



He Ser 


Lys Val Thr Gly He 


370 


375 


Pro Val 


Asn He Asp Trp Phe 


385 


390 


<210> 


101 


<211> 


1188 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


plasmid 13036 


<400> 


101 



380 



395 



60 



atggccgcct ccctcccgac catgccgccg tccggctacg accaggtgcg caacggcgtg 
ccgcgcggcc aggtggtgaa catctcctac ttctccaccg ccaccaactc cacccgcccg 120 
gcccgcgtgt acctcccgcc gggctactcc aaggacaaga agtactccgt gctctacctc 180 
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ctccacggca 


tcggcggctc 


cgagaacgac 


tggttcgagg 


gcggcggccg 


cgccaacgtg 


240 


atcgccgaca 


acctcatcgc 


cgagggcaag 


atcaagccgc 


tcatcatcgt 


gaccccgaac 


300 


accaacgccg 


ccggcccggg 


catcgccgac 


ggctacgaga 


acttcaccaa 


ggacctcctc 


360 


aactccctca 


tcccgtacat 


cgagtccaac 


tactccgtgt 


acaccgaccg 


cgagcaccgc 


420 


gccatcgccg 


gcctctctat 


gggcggcggc 


cagtccttca 


acatcggcct 


caccaacctc 


480 


gacaagttcg 


cctacatcgg 


cccgatctcc 


gccgccccga 


acacctaccc 


gaacgagcgc 


540 


ctcttcccgg 


acggcggcaa 


ggccgcccgc 


gagaagctca 


agctcctct t 


catcgcctgc 


600 


ggcaccaacg 


actccctcat 


cggcttcggc 


cagcgcgtgc 


acgagtactg 


cgtggccaac 


660 


aacatcaacc 


acgtgtactg 


gctcatccag 


ggcggcggcc 


acgact tcaa 


cgtgtggaag 


720 


ccgggcctct 


ggaacttcct 


ccagatggcc 


gacgaggccg 


gcctcacccg 


cgacggcaac 


780 


accccggtgc 


cgaccccgtc 


cccgaagccg 


gccaacaccc 


gcatcgaggc 


cgaggactac 


840 


gacggcat ca 


actcctcctc 


catcgagatc 


atcggcgtgc 


cgccggaggg 


cggccgcggc 


900 


atcggctaca 


tcacctccgg 


cgactacctc 


gtgtacaagt 


ccatcgact t 


cggcaacggc 


960 


gccacctcct 


tcaaggccaa 


ggtggccaac 


gccaacacct 


ccaacatcga 


gcttcgcctc 


1020 


aacggcccga 


acggcaccct 


catcggcacc 


ctctccgtga 


agtccaccgg 


cgactggaac 


1080 


acctacgagg 


agcagacctg 


ctccatctcc 


aaggtgaccg 


gcatcaacga 


cctctacctc 


1140 


gtgttcaagg 


gcccggtgaa 


catcgactgg 


ttcaccttcg 


gcgtgtag 




1188 



<210> 102 
<211> 395 
<212> PRT 

<213> artificial sequence 
<220> 

<223> plasmid 13036 
<400> 102 

Met Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr Asp Gin Val 
15 10 15 

Arg Asn Gly Val Pro Arg Gly Gin Val Val Asn lie Ser Tyr Phe Ser 

20 25 30 

Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu Pro Pro Gly 
35 40 45 
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Tyr Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu His Gly lie 
50 55 60 

5 Gly Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg Ala Asn Val 
65 70 75 80 

lie Ala Asp Asn Leu lie Ala Glu Gly Lys lie Lys Pro Leu He He 
10 85 90 95 

Val Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly He Ala Asp Gly Tyr 

100 105 110 

15 

Glu Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu He Pro Tyr He Glu 
115 120 125 

20 

Ser Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala lie Ala Gly 
130 135 140 

25 Leu Ser Met Gly Gly Gly Gin Ser Phe Asn He Gly Leu Thr Asn Leu 
145 150 155 160 

Asp Lys Phe Ala Tyr He Gly Pro He Ser Ala Ala Pro Asn Thr Tyr 
30 165 170 175 

Pro Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala Arg Glu Lys 

180 185 190 

35 

Leu Lys Leu Leu Phe He Ala Cys Gly Thr Asn Asp Ser Leu He Gly 
195 200 205 

40 

Phe Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn He Asn His 
210 215 220 

45 Val Tyr Trp Leu He Gin Gly Gly Gly His Asp Phe Asn Val Trp Lys 
225 230 235 240 

Pro Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala Gly Leu Thr 
50 245 250 255 

Arg Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys Pro Ala Asn 

260 265 270 

55 

Thr Arg He Glu Ala Glu Asp Tyr Asp Gly He Asn Ser Ser Ser lie 
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275 280 285 

Glu lie He Gly Val Pro Pro Glu Gly Gly Arg Gly He Gly Tyr He 
5 290 295 300 

Thr Ser Gly Asp Tyr Leu Val Tyr Lys Ser He Asp Phe Gly Asn Gly 
305 310 315 320 

10 

Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr Ser Asn He 

325 330 335 

15 

Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu He Gly Thr Leu Ser 

340 345 350 

20 Val Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin Thr Cys Ser 

355 360 365 

He Ser Lys Val Thr Gly He Asn Asp Leu Tyr Leu Val Phe Lys Gly 
25 370 375 380 



30 



Pro Val Asn He Asp Trp Phe Thr Phe Gly Val 
385 390 395 



<210> 103 

<211> 1245 

<212> DNA 

35 <213> artificial sequence 

<220> 

<223> plasmid 13038 

40 <400> 103 





atgagggtgt 


tgctcgttgc 


cctcgctctc 


ctggctctcg 


ctgcgagcgc 


cacctccatg 


60 




gccgcctccc 


tcccgaccat 


gccgccgtcc 


ggctacgacc 


aggtgcgcaa 


cggcgtgccg 


120 


45 


cgcggccagg 


tggtgaacat 


ctcctacttc 


tccaccgcca 


ccaactccac 


ccgcccggcc 


180 




cgcgtgtacc 


tcccgccggg 


ctactccaag 


gacaagaagt 


actccgtgct 


ctacctcctc 


240 


50 


cacggcatcg 


gcggctccga 


gaacgactgg 


ttcgagggcg 


gcggccgcgc 


caacgtgatc 


300 


gccgacaacc 


tcatcgccga 


gggcaagatc 


aagccgctca 


tcatcgtgac 


cccgaacacc 


360 




aacgccgccg 


gcccgggcat 


cgccgacggc 


tacgagaact 


tcaccaagga 


cctcctcaac 


420 


55 


tccctcatcc 


cgtacatcga 


gtccaactac 


tccgtgtaca 


ccgaccgcga 


gcaccgcgcc 


480 




atcgccggcc 


tctctatggg 


cggcggccag 


tccttcaaca 


tcggcctcac 


caacctcgac 


540 
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day l. Ltytt u 




aafrh rrarr 

y CI l_ V^. l_ V_ V* y V» V* 


y V— - V— \ — cx c* w c* 


cctia.ccccia.3L 


ca aacacc tc 

^-J W4 ~*-J W *w 


600 

v \J \J 








aaactcaaac 


tcctcttcat 

"w W» Vp* \pj* w V** V* W 


cacctacaac 


660 


d^v_dd^_.ydi_. t_ 


\— ^ L- ^ c* l Lyy 


v.- i— i—i^yyv-v-dy 


^ y *-y ^ a ^-y 


Aatacfcacat 


aaccaacaac 

^■J w« w w* "w V-* 


720 


3 t* /-* aa aa p p ai t~*f\ 


ty ^auLyyuL. 


pat" p annnr 


cici c an c c a err 

yy**yy^^ a ^*y 


apt- \~ Pi C CS t~ 


at" era a aacca 
y ^yya a y^^y 


780 


ggcccccgga 


-3 /— • f- 4— p /~» t- p p*» aa 
aLLLLLLLta 


yd LyyuLya^ 


oj=>nnppnopp 


LLdLLtyLya 


panes ararr 

w y y w ex c\ \^ ci v_» 


84 0 


ccgycgccga 


ccccytcccc 


gaagccggcc 


-3 a r"» a p> #-» r* n a 


ut_ydyyc_<_yd 


yyav, Lcn-y 


-7 \J \J 


*"*t /T ^ a H f~* as a /"■» t~ 


t_.t_.L-(w-<~L.l_l_.dl— 




yy ^-y i-yttyt 


uyydyyy uyy 


v^wy\^yy^wt-\— 


960 

S \J \J 


ygCCd.CciL.L.d 


t« LLLyyuy a 


L LdL L LLy ty 


G v_ a a y L^^a 


frn^rt" t~ p n n 

LLyav- V— *w y y 


p3 a C CIC1 CI e* 
c* ci w y y \^ 


102 0 


acctccttca 


aggecaaggt 


ggccaacgcc 


aacacctcca 


acatcgagct 


tcgcctcaac 


1080 


ggcccgaacg 


gcaccctcat 


cggcaccctc 


tccgtgaagt 


ccaccggcga 


ctggaacacc 


1140 


tacgaggagc 


agacctgctc 


catctccaag 


gtgaceggea 


tcaacgacct 


ctacctcgtg 


1200 


ttcaagggcc 


egg tgaacat 


cgactggt tc 


accttcggcg 


tgtag 




1245 



<210> 104 
<211> 414 
<212> PRT 

<213> artificial sequence 
<220> 

<223> plasmid 13038 aa 
<400> 104 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
15 10 15 



Ala Thr Ser Met Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr 

20 25 30 



Asp Gin Val Arg Asn Gly Val Pro Arg Gly Gin Val Val Asn lie Ser 
35 40 45 



Tyr Phe Ser Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu 
50 55 60 



Pro Pro Gly Tyr Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu 
65 70 75 80 



His Gly lie Gly Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg 
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85 90 95 

Ala Asn Val He Ala Asp Asn Leu He Ala Glu Gly Lys He Lys Pro 
5 100 105 110 

Leu lie He Val Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly He Ala 
115 120 125 

10 

Asp Gly Tyr Glu Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu lie Pro 
130 135 140 

15 

Tyr lie Glu Ser Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala 
145 150 155 160 

20 He Ala Gly Leu Ser Met Gly Gly Gly Gin Ser Phe Asn lie Gly Leu 

165 170 175 



Thr Asn Leu Asp Lys Phe Ala Tyr lie Gly Pro lie Ser Ala Ala Pro 
25 180 185 190 

Asn Thr Tyr Pro Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala 
195 200 205 

30 

Arg Glu Lys Leu Lys Leu Leu Phe lie Ala Cys Gly Thr Asn Asp Ser 
210 215 220 

35 

Leu He Gly Phe Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn 
225 230 235 240 

40 He Asn His Val Tyr Trp Leu He Gin Gly Gly Gly His Asp Phe Asn 

245 250 255 



Val Trp Lys Pro Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala 
45 260 265 270 

Gly Leu Thr Arg Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys 
275 280 285 

50 

Pro Ala Asn Thr Arg He Glu Ala Glu Asp Tyr Asp Gly He Asn Ser 
290 295 300 

55 

Ser Ser lie Glu lie lie Gly Val Pro Pro Glu Gly Gly Arg Gly He 
305 310 315 320 
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Gly Tyr lie Thr Ser Gly Asp Tyr Leu Val Tyr Lys Ser lie Asp Phe 

325 330 335 

5 

Gly Asn Gly Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr 

340 345 350 

10 

Ser Asn lie Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu lie Gly 

355 360 365 



15 Thr Leu Ser Val Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin 

370 375 380 

Thr Cys Ser He Ser Lys Val Thr Gly He Asn Asp Leu Tyr Leu Val 

20 385 390 395 400 



Phe Lys Gly Pro Val Asn He Asp Trp Phe Thr Phe Gly Val 

4 05 4 10 

25 

<210> 105 
<211> 1425 
<212> DNA 
30 <213> artificial sequence 





<220> 

<223> plasmid 13039 










35 


<400> 105 
atgctggcgg 


ctctggccac 


gtcgcagctc 


gtcgcaacgc 


gcgccggcct 


gggcgtcccg 




gacgcgtcca 


cgttccgccg 


cggcgccgcg 


cagggcctga 


ggggggcccg 


ggcgtcggcg 


40 


gcggcggaca 


cgctcagcat 


gcggaccagc 


gcgcgcgcgg 


cgcccaggca 


ccagcaccag 




caggcgcgcc 


gcggggccag 


gttcccgtcg 


ctcgtcgtgt 


gcgccagcgc 


cggcgccatg 


45 


gccgcctccc 


tcccgaccat 


gccgccgtcc 


ggctacgacc 


aggtgcgcaa 


cggcgtgccg 


cgcggccagg 


tggtgaacat 


ctcctacttc 


tccaccgcca 


ccaactccac 


ccgcccggcc 




cgcgtgtacc 


tcccgccggg 


ctactccaag 


gacaagaagt 


actccgtgct 


ctacctcctc 


50 


cacggcatcg 


gcggctccga 


gaacgactgg 


ttcgagggcg 


gcggccgcgc 


caacgtgatc 




gccgacaacc 


tcatcgccga 


gggcaagatc 


aagccgctca 


tcatcgtgac 


cccgaacacc 


55 


aacgccgccg 


gcccgggcat 


cgccgacggc 


tacgagaact 


tcaccaagga 


cctcctcaac 


tccctcatcc 


cgtacatcga 


gtccaactac 


tccgtgtaca 


ccgaccgcga 


gcaccgcgcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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atcgccggcc 


tctctatggg 


cggcggccag 


tccttcaaca 


tcggcctcac 


caacctcgac 


720 




aagttcgcct 


acatcggccc 


gatctccgcc 


gccccgaaca 


cctacccgaa 


cgagcgcctc 


780 


5 


ttcccggacg 


geggcaagge 


cgcccgcgag 


aagctcaagc 


tcctcttcat 


cgcctgcggc 


840 




accaacgact 


ccctcatcgg 


cttcggccag 


cgcgtgcacg 


agtactgcgt 


ggccaacaac 


900 


10 


atcaaccacg 


tgtactggct 


catccagggc 


ggcggccacg 


acttcaacgt 


gtggaagccg 


960 


ggcctctgga 


acttcctcca 


gatggecgae 


gaggccggcc 


tcacccgcga 


cggcaacacc 


1020 




ccggtgccga 


ccccgtcccc 


gaagccggcc 


aacacccgca 


tegaggcega 


ggactacgac 


1080 


15 


ggcatcaact 


cctcctccat 


cgagatcatc 


ggcgtgccgc 


eggagggegg 


ccgcggcatc 


1140 




ggctacatca 


cctccggcga 


ctacctcgtg 


tacaagtcca 


tegacttegg 


caacggcgcc 


1200 


20 


acctcct tea 


aggecaaggt 


ggccaacgcc 


aacacctcca 


acatcgagct 


tcgcctcaac 


1260 


ggcccgaacg 


gcaccctcat 


cggcaccctc 


tccgtgaagt 


ccaccggcga 


ctggaacacc 


1320 




tacgaggagc 


agacctgctc 


catctccaag 


gtgaceggea 


tcaacgacct 


ctacctcgtg 


1380 


25 


ttcaagggee 


eggtgaacat 


cgactggttc 


accttcggcg 


tgtag 




1425 



<210> 106 

<211> 474 

30 <212> PRT 

<213> artificial sequence 



35 



40 



45 



<220> 

<223> plasmid 13039 aa 
<400> 106 

Met Leu Ala Ala Leu Ala Thr Ser Gin Leu Val Ala Thr Arg Ala Gly 
15 10 15 



Leu Gly Val Pro Asp Ala Ser Thr Phe Arg Arg Gly Ala Ala Gin Gly 

20 25 30 



Leu Arg Gly Ala Arg Ala Ser Ala Ala Ala Asp Thr Leu Ser Met Arg 
3 5 40 45 



50 Thr Ser Ala Arg Ala Ala Pro Arg His Gin His Gin Gin Ala Arg Arg 

50 55 60 

Gly Ala Arg Phe Pro Ser Leu Val Val Cys Ala Ser Ala Gly Ala Met 

55 65 " 70 75 80 
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Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr Asp Gin Val Arg 

85 90 95 

5 Asn Gly Val Pro Arg Gly Gin Val Val Asn lie Ser Tyr Phe Ser Thr 

100 105 HO 

Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu Pro Pro Gly Tyr 
10 115 120 125 

Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu His Gly lie Gly 
130 135 140 

15 

Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg Ala Asn Val He 
145 150 155 160 

20 

Ala Asp Asn Leu He Ala Glu Gly Lys He Lys Pro Leu He He Val 

165 170 175 

25 Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly He Ala Asp Gly Tyr Glu 

180 185 190 

Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu He Pro Tyr He Glu Ser 
30 195 200 205 

Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala He Ala Gly Leu 
210 215 220 

35 

Ser Met Gly Gly Gly Gin Ser Phe Asn He Gly Leu Thr Asn Leu Asp 
225 230 235 240 

40 

Lys Phe Ala Tyr He Gly Pro He Ser Ala Ala Pro Asn Thr Tyr Pro 

245 250 255 



45 Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala Arg Glu Lys Leu 

260 265 270 

Lys Leu Leu Phe He Ala Cys Gly Thr Asn Asp Ser Leu He Gly Phe 

50 275 280 285 

Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn He Asn His Val 
290 295 300 

55 

Tyr Trp Leu He Gin Gly Gly Gly His Asp Phe Asn Val Trp Lys Pro 
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305 310 315 320 



Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala Gly Leu Thr Arg 

325 330 335 



Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys Pro Ala Asn Thr 

340 345 350 



Arg lie Glu Ala Glu Asp Tyr Asp Gly lie Asn Ser Ser Ser lie Glu 
355 360 365 



lie He Gly Val Pro Pro Glu Gly Gly Arg Gly He Gly Tyr He Thr 
370 375 380 



Ser Gly Asp Tyr Leu Val Tyr Lys Ser He Asp Phe Gly Asn Gly Ala 
385 ^ 390 395 400 



Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr Ser Asn He Glu 

405 410 415 



Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu He Gly Thr Leu Ser Val 

420 425 430 



Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin Thr Cys Ser He 
435 440 445 



Ser Lys Val Thr Gly He Asn Asp Leu Tyr Leu Val Phe Lys Gly Pro 
450 455 460 



Val Asn lie Asp Trp Phe Thr Phe Gly Val 



465 


470 


<210> 


107 


<211> 


1263 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


plasmid 13347 


<400> 


107 



atgagggtgt tgctcgttgc cctcgctctc ctggctctcg ctgcgagcgc cacctccatg 60 
gccgcctccc tcccgaccat gccgccgtcc ggctacgacc aggtgcgcaa cggcgtgccg 120 
cgcggccagg tggtgaacat ctcctacttc tccaccgcca ccaactccac ccgcccggcc 180 
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cgcgtgtacc 


tcccgccggg 


ctaccccaag 


gacaagaag u 






94 0 

^* " w 


cacggcatcg 


gcggctccga 


gaacgactgg 


t tcgagggcg 


i't ^"t /""^fl 

gcggccgcgc 


3 o \r ei sx i~ 
CddLy Ly aLL 


^ n o 


gccgacaacc 


tcatcgccga 


gggcaagacc 


aagccgc tea 






J O v 


aacgccgccg 


gcccgggcat 


cgccgacggc 


tacgagaact 


t caccaagga 


^ #^ 4~ a a o 


4 9 0 

*i £t \J 


tccctcatcc 


cgtacatcga 


gtccaactac 


cccgcgtaca 


ccgaccgcga 




4. r n 

*± O \J 


atcgccggcc 


tctctatggg 


cggcggccag 


tec ttcaaca 


tcggcctcac 


caaccLcgac 




aagt tcgcct 


acatcggccc 


gatctccgcc 


gccccgaaca 


cccacccgaa 


cyaycycc uc 




t tcccggacg 


gcggcaaggc 


cgcccgcgag 


aagc ccaagc 


f— *— -H^ 4-" 4— 

CCCCCCCCdL 


^ r-> f~> f- /~t ^ /-« /-» 
cgccLyuyyu 


^ 0 

D O \J 


accaacgact 


ccctcatcgg 


cttcggccag 


cgcgtgcacg 


ageactgegu 




7 9 0 


atcaaccacg 


tgtactggct 


catccagggc 


ggcggccacg 


acctcaacgt 


gtggaagccg 


/ O \J 


ggcctctgga 


act tcctcca 


gatggccgac 


gaggccggcc 


tcacccgcga 


Cyycaacacc 


ft A 0 


ccggtgccga 


ccccgtcccc 


gaagccggcc 


aacacccgca 


tegaggcega 


yyaccacyac 


q n n 


ggcatcaact 


cctcctccat 


cgagatcatc 


ggcgtgccgc 


eggagggegg 


ccytyytdLu 


Q 6 0 
o V 


ggctacatca 


cctccggcga 


c tacctcgtg 


t acaagccca 


Lcgacc tcgg 


cdacyyLytc, 


109 0 

X \J ^ u 


acctccttca 


aggccaaggt 


ggccaacgcc 


aacacctcca 


acatcgagct 


tcgcctcaac 


i r\ q a 


ggcccgaacg 


gcaccctcat 


cggcaccctc 


tccgtgaagt 


ccaccggcga 


ctggaacacc 


1140 


tacgaggagc 


agacctgctc 


catctccaag 


gtgaceggea 


tcaacgacct 


ctacctcgtg 


1200 


ttcaagggcc 


cggtgaacat 


cgactggttc 


accttcggcg 


tgtccgagaa 


ggacgaactc 


1260 


tag 












1263 



<210> 108 
<211> 420 
<212> PRT 

<213> artificial sequence 
<220> 

<223> plasmid 13347 
<400> 108 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
15 10 15 

Ala Thr Ser Met Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr 

20 25 30 
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Asp Gin Val Arg Asn Gly Val Pro Arg Gly Gin Val Val Asn lie Ser 
35 40 45 

5 

Tyr Phe Ser Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu 
50 55 60 



10 Pro Pro Gly Tyr Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu 

65 70 75 80 

His Gly lie Gly Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg 
15 85 90 95 



Ala Asn Val lie Ala Asp Asn Leu lie Ala Glu Gly Lys lie Lys Pro 

100 105 110 

20 

Leu lie lie Val Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly lie Ala 
115 120 125 

25 

Asp Gly Tyr Glu Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu lie Pro 
130 135 140 

30 Tyr lie Glu Ser Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala 
145 150 155 160 

lie Ala Gly Leu Ser Met Gly Gly Gly Gin Ser Phe Asn lie Gly Leu 
35 165 170 175 



Thr Asn Leu Asp Lys Phe Ala Tyr lie Gly Pro lie Ser Ala Ala Pro 

180 185 190 

40 

Asn Thr Tyr Pro Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala 
195 200 205 

45 

Arg Glu Lys Leu Lys Leu Leu Phe lie Ala Cys Gly Thr Asn Asp Ser 
210 215 220 



50 Leu lie Gly Phe Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn 
225 230 235 240 



lie Asn His Val Tyr Trp Leu lie Gin Gly Gly Gly His Asp Phe Asn 
55 245 250 255 
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Val Trp Lys Pro Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala 

260 265 270 



Gly Leu Thr Arg Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys 
275 280 285 



Pro Ala Asn Thr Arg lie Glu Ala Glu Asp Tyr Asp Gly lie Asn Ser 
290 295 300 



Ser Ser He Glu He He Gly Val Pro Pro Glu Gly Gly Arg Gly He 
305 310 315 320 



Gly Tyr He Thr Ser Gly Asp Tyr Leu Val Tyr Lys Ser He Asp Phe 

325 330 335 



Gly Asn Gly Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr 

340 345 350 



Ser Asn He Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu He Gly 
355 " 360 365 



Thr Leu Ser Val Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin 
370 375 380 



Thr Cys Ser He Ser Lys Val Thr Gly He Asn Asp Leu Tyr Leu Val 
385 390 395 400 



Phe Lys Gly Pro Val Asn He Asp Trp Phe Thr Phe Gly Val Ser Glu 

405 410 415 



Lys Asp Glu Leu 

420 



<210> 


109 


<211> 


1296 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


plasmid 11267 


<400> 


109 



atgagggtgt tgctcgttgc cctcgctctc ctggctctcg ctgcgagcgc caccagcgct 



60 



gcgcagtccg agccggagct gaagctggag tccgtggtga tcgtgtcccg ccacggcgtg 120 
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180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1296 





cgcgccccga 


ccaaggccac 


ccagctcatg 


caggacgtga 


ccccggacgc 


ctggccgacc 




tggccggtga 


agctcggcga 


gctgaccccg 


cgcggcggcg 


agetgatege 


ctacctcggc 


5 


cactactggc 


gccagcgcct 


cgtggccgac 


ggcctcctcc 


egaagtgegg 


ctgcccgcag 




tccggccagg 


tggccatcat 


cgccgacgtg 


gacgagcgca 


cccgcaagac 


cggcgaggcc 


10 


ttcgccgccg 


gcctcgcccc 


ggactgcgcc 


atcaccgtgc 


acacccaggc 


cgacacctcc 


tccccggacc 


cgctcttcaa 


cccgctcaag 


accggcgtgt 


gccagctcga 


caacgccaac 




gtgaccgacg 


ccatcctgga 


gcgcgccggc 


ggctcca teg 


ccgacttcac 


cggccactac 


15 


cagaccgcct 


tccgcgagct 


ggagcgcgtg 


ctcaacttcc 


cgcagtccaa 


cctctgcctc 




aagcgcgaga 


agcaggacga 


gtcctgctcc 


ctcacccagg 


ccctcccgtc 


cgagctgaag 


20 


gtgtccgccg 


actgcgtgtc 


cctcaccggc 


gccgtgtccc 


tcgcctccat 


gctcaccgaa 


atct tcctcc 


tccagcaggc 


ccagggcatg 


ccggagccgg 


gctggggccg 


catcaccgac 




tcccaccagt 


ggaacaccct 


cctc tccctc 


cacaacgccc 


agttcgacct 


cctccagcgc 


25 


accccggagg 


tggcccgctc 


ccgcgccacc 


ccgctcctcg 


acctcatcaa 


gaccgccctc 




accccgcacc 


cgccgcagaa 


gcaggcctac 


ggcgtgaccc 


tcccgacctc 


cgtgctcttc 


30 


atcgccggcc 


acgacaccaa 


cctcgccaac 


ctcggcggcg 


ccctggagct 


gaactggacc 


ctcccgggcc 


agccggacaa 


caccccgccg 


ggeggegage 


tggtgttcga 


gcgctggcgc 




cgcctctccg 


acaactccca 


gtggattcag 


gtgtccctcg 


tgttccagac 


cctccagcag 


35 


atgcgcgaca 


agaccccgct 


ctccctcaac 


accccgccgg 


gcgaggtgaa 


gctcaccctc 




gccggctgcg 


aggagcgcaa 


cgcccagggc 


atgtgctccc 


tcgccggctt 


cacccagatc 




gtgaacgagg 


cccgcatccc 


ggcctgctcc 


ctctaa 







<210> 110 

<211> 431 

<212> PRT 

45 <213> artificial sequence 

<220> 

<223> plasmid 11267 aa sequence 
50 <400> 110 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
15 10 15 



55 



Ala Thr Ser Ala Ala Gin Ser Glu Pro Glu Leu Lys Leu Glu Ser Val 

20 25 30 



302 



WO 2005/096804 



PCT/US2004/007182 



Val He Val Ser Arg His Gly Val Arg Ala Pro Thr Lys Ala Thr Gin 
35 40 45 

5 

Leu Met Gin Asp Val Thr Pro Asp Ala Trp Pro Thr Trp Pro Val Lys 
50 55 60 

10 

Leu Gly Glu Leu Thr Pro Arg Gly Gly Glu Leu He Ala Tyr Leu Gly 

65 70 75 80 

15 His Tyr Trp Arg Gin Arg Leu Val Ala Asp Gly Leu Leu Pro Lys Cys 

85 90 95 



Gly Cys Pro Gin Ser Gly Gin Val Ala He He Ala Asp Val Asp Glu 
20 100 105 110 

Arg Thr Arg Lys Thr Gly Glu Ala Phe Ala Ala Gly Leu Ala Pro Asp 
115 120 125 

25 

Cys Ala He Thr Val His Thr Gin Ala Asp Thr Ser Ser Pro Asp Pro 
130 135 140 

30 

Leu Phe Asn Pro Leu Lys Thr Gly Val Cys Gin Leu Asp Asn Ala Asn 
145 150 155 160 

35 Val Thr Asp Ala He Leu Glu Arg Ala Gly Gly Ser He Ala Asp Phe 

165 170 175 



Thr Gly His Tyr Gin Thr Ala Phe Arg Glu Leu Glu Arg Val Leu Asn 
40 180 185 190 

Phe Pro Gin Ser Asn Leu Cys Leu Lys Arg Glu Lys Gin Asp Glu Ser 
195 200 205 

45 

Cys Ser Leu Thr Gin Ala Leu Pro Ser Glu Leu Lys Val Ser Ala Asp 
210 215 220 

50 

Cys Val Ser Leu Thr Gly Ala Val Ser Leu Ala Ser Met Leu Thr Glu 
225 230 235 240 

55 He Phe Leu Leu Gin Gin Ala Gin Gly Met Pro Glu Pro Gly Trp Gly 

245 250 255 
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Arg lie Thr Asp Ser His Gin Trp Asn Thr Leu Leu Ser Leu His Asn 

260 265 270 



Ala Gin Phe Asp Leu Leu Gin Arg Thr Pro Glu Val Ala Arg Ser Arg 
275 280 285 



Ala Thr Pro Leu Leu Asp Leu lie Lys Thr Ala Leu Thr Pro His Pro 
290 295 300 



Pro Gin Lys Gin Ala Tyr Gly Val Thr Leu Pro Thr Ser Val Leu Phe 

305 310 315 320 

lie Ala Gly His Asp Thr Asn Leu Ala Asn Leu Gly Gly Ala Leu Glu 

325 330 335 



Leu Asn Trp Thr Leu Pro Gly Gin Pro Asp Asn Thr Pro Pro Gly Gly 

340 345 350 



Glu Leu Val Phe Glu Arg Trp Arg Arg Leu Ser Asp Asn Ser Gin Trp 
355 360 365 



He Gin Val Ser Leu Val Phe Gin Thr Leu Gin Gin Met Arg Asp Lys 
370 375 380 



Thr Pro Leu Ser Leu Asn Thr Pro Pro Gly Glu Val Lys Leu Thr Leu 
385 390 395 400 



Ala Gly Cys Glu Glu Arg Asn Ala Gin Gly Met Cys Ser Leu Ala Gly 

4 05 410 415 



Phe Thr Gin He Val Asn Glu Ala Arg He Pro Ala Cys Ser Leu 

425 430 





420 


<210> 


111 


<211> 


1314 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


plasmid 11268 


<400> 


111 



atgagggtgt tgctcgttgc cctcgctctc ctggctctcg ctgcgagcgc caccagcgct 



60 



gcgcagtccg agccggagct gaagctggag tccgtggtga tcgtgtcccg ccacggcgtg 120 
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cgcgccccga ccaaggccac ccagctcatg caggacgtga ccccggacgc ctggccgacc 180 

tggccggtga agctcggcga gctgaccccg cgcggcggcg agctgatcgc ctacctcggc 24 0 

cactactggc gccagcgcct cgtggccgac ggcctcctcc cgaagtgcgg ctgcccgcag 300 

tccggccagg tggccatcat cgccgacgtg gacgagcgca cccgcaagac cggcgaggcc 360 

ttcgccgccg gcctcgcccc ggactgcgcc atcaccgtgc acacccaggc cgacacctcc 420 

tccccggacc cgctcttcaa cccgctcaag accggcgtgt gccagctcga caacgccaac 480 

gtgaccgacg ccatcctgga gcgcgccggc ggctccatcg ccgacttcac cggccactac 540 

cagaccgcct tccgcgagct ggagcgcgtg ctcaacttcc cgcagtccaa cctctgcctc 600 

aagcgcgaga agcaggacga gtcctgctcc ctcacccagg ccctcccgtc cgagctgaag 660 

gtgtccgccg actgcgtgtc cctcaccggc gccgtgtccc tcgcctccat gctcaccgaa 720 

atcttcctcc tccagcaggc ccagggcatg ccggagccgg gctggggccg catcaccgac 780 

tcccaccagt ggaacaccct cctctccctc cacaacgccc agttcgacct cctccagcgc 840 

accccggagg tggcccgctc ccgcgccacc ccgctcctcg acctcatcaa gaccgccctc 900 

accccgcacc cgccgcagaa gcaggcctac ggcgtgaccc tcccgacctc cgtgctcttc 960 

atcgccggcc acgacaccaa cctcgccaac ctcggcggcg ccctggagct gaactggacc 1020 

ctcccgggcc agccggacaa caccccgccg ggcggcgagc tggtgttcga gcgctggcgc 1080 

cgcctctccg acaactccca gtggattcag gtgtccctcg tgttccagac cctccagcag 1140 

atgcgcgaca agaccccgct ctccctcaac accccgccgg gcgaggtgaa gctcaccctc 1200 

gccggctgcg aggagcgcaa cgcccagggc atgtgctccc tcgccggctt cacccagatc 1260 

gtgaacgagg cccgcatccc ggcctgctcc ctctccgaga aggacgagct gtaa 1314 

<210> 112 
<211> 437 
<212> PRT 

<213> artificial sequence 
<220> 

<223> plasmid 11268 amino acid sequence 
<400> 112 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 



Ala Thr Ser Ala Ala Gin Ser Glu Pro Glu Leu Lys Leu Glu Ser Val 



305 



WO 2005/096804 



PCT/US2004/007182 



20 25 30 

Val lie Val Ser Arg His Gly Val Arg Ala Pro Thr Lys Ala Thr Gin 
5 35 40 45 

Leu Met Gin Asp Val Thr Pro Asp Ala Trp Pro Thr Trp Pro Val Lys 
50 55 60 

10 

Leu Gly Glu Leu Thr Pro Arg Gly Gly Glu Leu lie Ala Tyr Leu Gly 
65 70 75 80 

15 

His Tyr Trp Arg Gin Arg Leu Val Ala Asp Gly Leu Leu Pro Lys Cys 

85 90 95 

20 Gly Cys Pro Gin Ser Gly Gin Val Ala lie lie Ala Asp Val Asp Glu 

100 105 110 



Arg Thr Arg Lys Thr Gly Glu Ala Phe Ala Ala Gly Leu Ala Pro Asp 
25 115 120 125 



Cys Ala He Thr Val His Thr Gin Ala Asp Thr Ser Ser Pro Asp Pro 
130 135 140 

30 

Leu Phe Asn Pro Leu Lys Thr Gly Val Cys Gin Leu Asp Asn Ala Asn 
145 150 155 160 

35 

Val Thr Asp Ala He Leu Glu Arg Ala Gly Gly Ser He Ala Asp Phe 

165 170 175 



40 Thr Gly His Tyr Gin Thr Ala Phe Arg Glu Leu Glu Arg Val Leu Asn 

180 185 190 

Phe Pro Gin Ser Asn Leu Cys Leu Lys Arg Glu Lys Gin Asp Glu Ser 
45 195 200 205 

Cys Ser Leu Thr Gin Ala Leu Pro Ser Glu Leu Lys Val Ser Ala Asp 
210 215 220 

50 

Cys Val Ser Leu Thr Gly Ala Val Ser Leu Ala Ser Met Leu Thr Glu 
225 230 235 240 

55 

He Phe Leu Leu Gin Gin Ala Gin Gly Met Pro Glu Pro Gly Trp Gly 

245 250 255 
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Arg lie Thr Asp Ser His Gin Trp Asn Thr Leu Leu Ser Leu His Asn 

260 265 270 

5 

Ala Gin Phe Asp Leu Leu Gin Arg Thr Pro Glu Val Ala Arg Ser Arg 
275 280 285 

10 

Ala Thr Pro Leu Leu Asp Leu lie Lys Thr Ala Leu Thr Pro His Pro 

290 295 300 

15 Pro Gin Lys Gin Ala Tyr Gly Val Thr Leu Pro Thr Ser Val Leu Phe 
305 310 315 320 

lie Ala Gly His Asp Thr Asn Leu Ala Asn Leu Gly Gly Ala Leu Glu 
20 325 330 335 



Leu Asn Trp Thr Leu Pro Gly Gin Pro Asp Asn Thr Pro Pro Gly Gly 

340 345 350 

25 

Glu Leu Val Phe Glu Arg Trp Arg Arg Leu Ser Asp Asn Ser Gin Trp 
355 360 365 

30 

lie Gin Val Ser Leu Val Phe Gin Thr Leu Gin Gin Met Arg Asp Lys 
370 375 380 



35 Thr Pro Leu Ser Leu Asn Thr Pro Pro Gly Glu Val Lys Leu Thr Leu 

385 390 395 400 

Ala Gly Cys Glu Glu Arg Asn Ala Gin Gly Met Cys Ser Leu Ala Gly 

40 405 410 415 



Phe Thr Gin lie Val Asn Glu Ala Arg lie Pro Ala Cys Ser Leu Ser 

420 425 430 

45 

Glu Lys Asp Glu Leu 
435 

50 



307 



